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DNA DIAGNOSTICS BASED ON MASS SPECTROMETRY 
Related Applications 

For U.S. National Stage purposes, this application is a 
5 continuation-in-part of U.S. application Serial No. 08/744,481, filed 
November 6, 1996, to Koster, entitled "DNA DIAGNOSTICS BASED ON 
MASS SPECTROMETRY-. This application is also a continuation-in-part 
of U.S. application Serial Nos. 08/744,590. 08/746,036. 08/746.055. 
08/786.988. 08/787.639, 08/933.792 and U.S. application Serial No. 
10 atty dkt. no. 7352-2001 8. filed October 8, 1 997. which is a 

continuation-in-part of U.S. application Nos. 08/746,055, 08/786.988 
and 08/787.639. For international purposes, benefit of priority Is 
claimed to each of these applicati'ons. 

This application is related to U.S. Patent Application Serial No. 
15 08/617,256 filed on March 18, 1996. which is a continuation-in-part of 
U.S. application Serial No. 08/406,193, filed March 17, 1995. now U.S. 
Patent No. 5,605,798, and is also related U.S. Patent Nos. 5.547,835 
and 5,622.824. 

Where permitted the subject matter of each of the above-noted 
20 patent applications and the patent is herein incorporated in Its entirety. 
BACKGROUND OF THE INVENTION 
Detection of mutations 

The genetic information of all living organisms (e.g.. animals, 
plants and microorganisms) Is encoded in deoxyribonucleic acid (DNA). 
25 In humans, the complete genome is contains of about 100,000 genes 
located on 24 chromosomes (The Human Genome, T. Strachan, BIOS 
Scientific Publishers, 1992). Each gene codes for a specific protein, 
which after its expression via transcription and translation, fulfills a 
sp cific bioch mical function within a living cell. Changes in a DNA 
30 s quence are known as mutations and can result in proteins with altered 
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or in some cases even lost biochemical activities; this in turn can cause 
genetic disease. Mutations include nucleotide deletions, insertions or 
alterations (i.e. point mutations). Point mutations can be either 
-missense-, resulting in a change in the amino acid sequence of a protein 
5 or -nonsense- coding for a stop codon and thereby leading to a 
truncated protein. 

More than 3000 genetic diseases are currently known (Human 
Genome Mutations, D. N. Cooper and M. Krawczak, BIOS Publishers, 
1 993), including hemophilias, thalassemias, Duchenne Muscular 
10 Dystrophy (DMD), Huntington's Disease (HD). Alzheimer's Disease and 
Cystic Fibrosis (CF). In addition to mutated genes, which result in 
genetic disease, certain birth defects are the result of chromosomal 
abnormalities such as Trisomy 21 (Down's Syndrome), Trisomy 13 
(Patau Syndrome), Trisomy 18 (Edward's Syndrome), Monosomy X 
1 5 (Turner's Syndrome) and other sex chromosome aneuploidies such as 
Klienfelter's Syndrome (XXY). Further, there is growing evidence that 
certain DNA sequences may predispose an individual to any of a number 
of diseases such as diabetes, arteriosclerosis, obesity, various 
autoimmune diseases and cancer (e^ colorectal, breast, ovarian, lung). 
20 Viruses, bacteria, fungi and other infectious organisms contain 

distinct nucleic acid sequences, which are different from the sequences 
contained In the host cell. Therefore, infectious organisms can also be 
detected and identified based on their specific DNA sequences. 
Since the sequence of about 16 nucleotides is specific on 
25 statistical grounds even for the size of the human genome, relatively 

short nucleic acid sequences can be used to detect nonnal and defective 
g nes in higher organisms and to detect infectious microorganisms (e^. 
bacteria, fungi, protists and yeast) and viruses. DNA sequences can 
even serve as a fingerprint for det ction of different individuals within the 
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same species (see, Thompson, J. S. and M. W. Thompson, eds.. 
Genetics in Medioinft. W.R. Saunders Co., Philadelphia, PA (1991)). 

Several methods for detecting DNA are currently being used. For 
example, nucleic acid sequences can be identified by comparing the 
5 mobility of an amplified nucleic acid fragment with a known standard by 
gel electrophoresis, or by hybridization with a probe, which is 
complementary to the sequence to be identified. Identification, however, 
can only be accomplished If the nucleic acid fragment is^ labeled with a 
sensitive reporter function (e.g.. radioactive (^^P, ^^S), fluorescent or 
10 chemiluminescent). Radioactivie labels can be hazardous and the signals 
tiiey produce decay over time. Non-isotopic labels (e.g.. fluorescent) 
suffer from a lack of sensitivity and fading of the signal when high 
intensity lasers are being used. Additionally, performing labeling, 
electrophoresis and subsequent detection are laborious, time-consuming 
1 5 and error-prone procedures. Electrophoresis is particularly error-prone, 
since the size or the molecular weight of the nucleic acid cannot be 
directiy correlated to the mobility in the gel matrix. It is known that 
sequence specific effects, secondary structure and interactions with the 
gel matrix are causing artefacts. 
20 Use of mass spectrometry for detection and identification of nucleic acids 
Mass spectrometry provides a means of "weighing" individual 
molecules by ionizing the molecules in vaccuo and making them "fly" by 
volatilization. Under the influence of combinations of electric and 
magnetic fields, the ions follow trajectories depending on their individual 
25 mass (m) and charge (z). In the range of molecules with low molecular 
weight, mass spectrometry has long been part of the routine physical- 
organic r pertoire for analysis and characterization of organic molecules 
by th determination bf the mass of the par nt molecular ion. In 
addition, by arranging collisions of this parent molecular ion with other 
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25 



particles (e^ argon atoms), the molecular ion is fragmented forming 
secondary Ions by the so-called collision induced dissociation (CID). The 
fragmentation pattern/pathway very often allows the derivation of 
detailed structural information. Many applications of mass spectrometric 
methods are known in the art, particulariy in biosciences (see e.g.. 
iVIethods in Fnrymol Vol. 193: -Mass Spectrometry- (J. a. McCloskey 
editor), 1990, Academic Press, New York). 

Because of the apparent analytical advantages of mass 
spectrometry in providing high detection sensitivity, accuracy of mass 
measurements, detailed structural information by CID in conjunction with 
an MS/MS configuration and speed, as well as on-line data transfer to a 
computer, there has been interest in the use of mass spectrometry for 
the structural analysis of nucleic acids. Recent reviews summarizing this 
field include K.H. Schram, "Mass Spectrometry of Nucleic Acid 
1 5 Components, Biomedical Applications of Mass Spectrometry- 34, 
203-287 (1990); and P.P. Crain, -Mass Spectrometric Techniques in 
Nucleic Acid Research.- Mass Sn^ntro.^.^ a^..-.^,,,. ^ 505-554 (1990)- 

see, also U.S. Patent No. 5,547,835 and U.S. Patent No. 5,622,824). 

Nucleic acids, however, are very polar biopolymers that are very 
difficult to volatilize. Consequently, mass spectrometric detection has 
been limited to low molecular weight synthetic oligonucleotides for 
confirming an already known oligonucleotide sequence by determining 
the mass of the parent molecular Ion, or alternatively, confirming a 
known sequence through the generation of secondary ions (fragment 
•ons) via CID in an MS/MS configuration using, in particular, for the 
ionization and volatilization, the method of fast atomic bombardment 
(FAB mass sp ctrometry) or plasma desprption |PD mass spectrometry) 
As an example, the application of FAB to the analysis of protected 
dimeric blocks for chemical synthesis of oligodeoxynucleotides has been 



20 
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described (Koster et aL ( 1 987) Bfomed. Environ. Ma«c SDectrnm^try 14, 
111-116). 

Other ionization/desorption techniques include electrospray/ion- 
spray (ES) and matrix-assisted laser desorption/ionization (MALDf). ES 
5 mass spectrometry has been introduced by Fenn et aL (J. Rhy s, nhom 
88:4451-59 (1984); PCT Application No. WO 90/14148) and current 
applications are summarized in review articles (see, e.g.. Smith et al. 
(1990) Anal. Chem. 62:882-89 and Ardrey (1992) Electrospray Mass 
Spectrometry, Spectroscopy Europe 4:10-18). The molecular weights of 
10 a tetradecanucleotide (see. Covey et ah (1 988) The "Determination of 

Protein, Oligonucleotide and Peptide Molecular Weights by lonspray Mass 
Spectrometry." Rapid Commun . in Mass Soectromfltry 9.0^0-71^^} 
of a 21-mer (Methods I n Enzvmol.. 193. "Mass Spectrometry" 
(McCloskey, editor), p. 425, 1990, Academic Press, New York) have 
15 been published. As a mass analyzer, a quadrupole is most frequently 
used. Because of the presence of multiple ion peaks that all could be 
used for the mass calculation, the determination of molecular weights in 
femtomole amounts of sample is very accurate. 

MALOl mass spectrometry, in contrast, can be attractive when a 
20 time-of-flight (TOP) configuration (see, Hillenkamp et aL ( 1 990) pp 49-6O 
in "Matrix Assisted UV-Laser Desorption/ionization: A' New Approach to 
Mass Spectrometry of Large Biomoiecules," Biolooical Maco 
Spectrometry, Buriingame and McCloskey, editors, Elsevier Science 
Publishers, Amsterdam) is used as a mass analyzer. Since, in most 
25 cases, no multiple molecular ion peaks are produced with this technique, 

the mass spectra, in principle, look simpler compared to ES mass spectrometry. 



wo 98/20166 



PCT/US97/20444 



-6- 



Although DNA molecules up to a molecular weight of 410,000 
daltons have been desorbed and volatilized (Williams etaf., "Volatilization 
of High Molecular Weight DNA by Pulsed Laser Ablation of Frozen 
Aqueous Solutions," SsieQcg 246, 1 585-87 (1 989)), tills technique had 
5 only shown very low resolution (oligotfiymidylic acids up to 1 8 

nucleotides. Hutii-Fehre gt aL Rapid Commun M..« ^r-*-r^xn 6, 
209-13 (1992); DNA fragments up to 500 nucleotides in lengtii K. Tang 
era/.. Rapid Commun in Mass SpArtron,^ , 8, 727-730.(1994); and a 
double-stranded DNA of 28 base pairs (Williams a/., -r,me-of-Right 
10 Mass Spectrometry of Nucleic Acids by^User Ablation and Ionization 
from a Frozen Aqueous Matiix," Rapid Comm..n ;„ m^o. gp^^trrrm 4 
348-351 (1990)). Japanese Patent No. 59-131909 describes an ~' 
instrument, which detects nucleic acid fragments separated either by 
electrophoresis, liquid chromatography or high speed gel filtration. Mass 
15 spectrometi^ic detection is achieved by incorporating into the nucleic 

acids, atoms, such as S. Br, I or Ag, Au, Pt, Os. Hg, tiiat normally do not 
occur in DNA, 

Co-owned U.S. Patent No. 5.622,824 describes metiiods for DNA 
sequencing based on mass specti-oitieti^ic detection. To achieve tiiis, the 
20 DNA is by means of protection, specificity of enzymatic activity, or 
immobilization, unilaterally degraded in a stepwise manner via 
exonuclease digestion and the nucleotides or derivatives detected by 
mass spectrometry. Prior to the enzymatic degradation, sets of ordered 
deletions tiiat span a cloned DNA fragment can be created. In tiiis 
25 manner, mass-modified nucleotides can be incorporated using a 

combination of exonuclease and DNA/RNA polymerase. This permits 

ither multiplex mass spectrometric detection, or modulation of tiie 
activity of tiie exonuclease so as to synchronize tfie d gradative process. 
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Co-owned U.S. Patent Nos. 5,605,798 and 5,547,835 provide methods 
for detecting a particular nucleic acid sequence in a biological 
sample. Depending on the sequence to be detected, the processes can 
be used, for example, in methods of diagnosis. These methods, while 
5 broadly useful and applicable to numerous embodiments, represent the 
first disclosure of such applications and can be improved upon. 

Therefore, it Is an object herein to provided improved methods for 
sequencing and detecting DNA molecules in biological samples. It is also 
an object herein to provided improved methods for diagnosis of genetic 
10 diseases, predispositions to cetain diseases, cancers, and infections. 
SUMMARY OF THE INVENTION 

Methods of diagnosis by detecting and/or determing sequences of 
nucleic acids that are based on mass spectrometry are provided herein. 
Methods are provided for detecting double-stranded DNA, detecting 
15 mutations and other diagnostic markers using MS analysis. In particular, 
methods for diagnosing neuroblastoma, detecting heredity relationships, 
HLA compatibility, genetic fingerprinting, detecting teleromase activity 
for cancer diagnosis are provided. 

In certain embodiments the DNA is immobilized on a solid support 
20 either directly or Via a linker and/or bead. Three permiitions of the 
methods for DNA detection in which immobilized DNA is used are 
exemplified. These Include: (1) immobilization of a template; 
hybridization of the primer; extension of the primer, or extension of the 
primer (single ddNTP) for sequencing or diagnostics or extension of the 
25 primer and Endonuclease degradation (sequencing); (2) immobilization of 
a primer; hybridization of a single stranded template; and extension of 
th primer, or extension f the primer (single ddNTP) for sequencing or 
diagnostics or extension of the primer and Endonuclease degradation 
(sequencing): (3) immobilization of th primer; hybridization of a double 
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stranded template; extension of the primer, or extension of the primer 
(single ddNTP, for sequencing or diagnostics or extension of the primer 
and Endonuclease degradation (sequencing). 

ONA la imn..bi.ized on *e support via 
a selec«velv cleavabte linker. Selacavely cleevabla linkers inCuda buu 

are no, ,i„,i,ed ,o photocleavable linkers, chemically cleavable linkers and 
an enzymatfcally (such as a restrioSon she (nucleic acid linker) a 
protease She) Cleavable linker., 'ndusion of a selectively cleaJable linker 
expands the capabilities of the MALDl-TOF MS analysis because „ allows 
for all of *e pemiutations of immobiliation of DNA for MALOl-TOF MS 
the ONA linkage ,o the support trough the 3'- or S'^nd of a nucleic ' 
acid; allows the amplified DNA or the target pdmer to bo extended by 
DNA synthesis: and further allows for «,e mass of the extended product 
(or degraded product via exonuclease degradation) to be of a size that is 
appropriate for MAU„-TOF MS analysis (i.e.. the Isolated or syn«,esi„d 
DNA can be large and a small primer or a large primer sequence can be 
"sed and a small restriction fragment of a gene or single strand thereof 
; nybndized thereto). 

m a preferred embodiment, the selectively cleavable linker is a 
20 chemical or photocleavable linker that is cleaved during the ionizing step 
Of mass spectrometry. Exemplary r,nke« Include linkers containing, a 
disulfide group, a leuvinyl group, an acid^ebile trityl group and a 
hydrophobic trityl group, in other embodiments, the enzymatically 
Cleavable linker can be a nucleic «M tha, is an RNA nuclaottde or the, 
25 ancodes a restriction endonuclease Site. 0«,er enzymatically cleavable 
•.nkers include linkers «,a. contain a pyrophosphate group, an arginine- 
arg-nme group and a lysine-lysine group, other link rs are exemplified 
n r in. 
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Methods for sequencing long fragments of DNA are provided. To 
perform such sequericing, specific base terminated fragments are 
generated from a target nucleic acid. The analysis of fragments rather 
than the full length nucleic acid shifts the mass of the ions to be 
5 determined into a lower mass range, which is generally more amenable to 
mass spectometric detection. For example, the shift to smaller masses 
increases mass resolution, mass accuracy and, in particular, the 
sensiti^nty for detection- Hybridization events and the- actual molecular 
weights of the fragments as determined by mass spectrometry provide 

10 sequence information {e.g.. the presence and/or identity of a mutation). 
In a preferred embodiment, the fragments are captured on a solid support 
prior to hybridization and/or mass spectrometry detection. In another 
preferred embodiment, the fragments generated are ordered to provide 
the sequence of the larger nucleic acid. 

15 One preferred method for generating base specifically terminated 

fragments from a nucleic acid is effected by contacting an appropriate 
amount of a target nucleic acid with an appropriate amount of a specific 
endonuclease, thereby resulting in partial or complete digestion of the 
target nucleic acid. Endonucleases will typically degrade a sequence into 

20 pieces of no more than about 50-70 nucleotides, even if the reaction is 
not run to full completion. In a preferred embodiment, the nucleic acid is 
a ribonucleic acid and the endonuclease is a ribonucle.ase (RNase) 
selected from among: the G-specific RNase T,, the A-specific RNase Uj, 
the AAJ specific RNase PhyM, U/C specific RNase A, C specific chicken 

25 liver RNase (RNase CL3) or crisavitin. In another preferred embodiment, 
th endonuclease is a restriction enzyme that cleaves at least one site 
contained within the target nucleic acid. Another preferred method for 
g n rating base specifically terminated fragments includes performing a 
c mbined amplification and base-specific t rmination reaction ( e.g. . using 
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an appropriate amount of a first DNA polymerase wh.VK k 

e.Ponen«a.,.p„fic,«o„ of *e «.«et; and a polymerase a reL^ . 
h-ah afflruty for *e chain terminating nuCaotide reaul«„„ ^ " 
S termination Of *epo.y„«.««on. .nc.„aio„ra ^r^T"'"'' 
end of a «r,e. ™o,aic acid can ,aoi.a^ ti,e ordari:;lft 

Methods for determining the sequence of «n .. aments. 
In wHioH «,e S- and/or 3- end of «,e ^121^"" '^'^ 
are provided. .nCueion of a non-na. Jtag on I 3 e„^ • T"" 

10 ou. or con^neatina for ti,einf,J„ce "3 nlllt^^^^^ 

P«n,a.re ..m,ination and nonapecifio eiongation. ,n ^^ZT 
emb<«,™ent ^ «a is an affinity ts, te^ „oti„ or a nu^'cid .h . 

fa^^T^d^T «.e:;:;ir 

facilitates bindina of «,e nucleic acid to a solid support l„ Z 
pn^ferred en,5odin«m. the tag is a mass marlcer ^ a „, r^ T 
*a. does not correspond .0 the mass of any of ^four :1::::T 

natural 3 heterogeneity that can result, for examni*. f 

reaction. example, from a transcription 

'''' replicatlT' T"" " ""^'^'^ -^^^ '^-e been 

repLcated from a nucleic acid molecule obtained from a biolooica. . 

are specifically digested using one or more nucleaseT " "'^'^ 

(deoxynbonucleases for DNA, and ribonucleases for RNA, are provided 
The fragments captured on a solid support carrvina th. P~^'^«<*- 
as .mplementary se,uences. Hyhridi^aTon e^rr::: "^^ 
molecular weights of ti,e captirred targe, sequences provide in,om»ti 
n mutations in the gene. T^e array can be analy. d spo^K ' " 
mass spectiometry. Further. ti,e fragments aene™.«. t 

Vide the sequence Of the larger ta g . "aZr " 
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In another embodiment, at least one primer with a 3'-terminal base 
is hybridized to the target nucleic acid near a site where possible 
mutations are to be detected. An appropriate polymerase and a set of 
three nucleoside triphosphates (NTPs) and the fourth added as a 
terminator are reacted. The extension reaction products are measured by 
mass spectrometry and are indicative of the presence and the nature of a 
mutation. The set of three NTPs and one dd-NTP (or three NTPs and one 
3'-deoxy NTP), will be varied to be able to discriminate between several 
mutations (including compound heterozygotes) in the target nucleic acid 
sequence. 

Methods for detecting and diagnosing neoplasia/malagnancies in a 
tissue or cell sample are provided- The methods rely on a 
telomeric repeat amplification protocol (TRAP) -MS assay and include the 
steps of: 

a) obtaining a tissue or a cell sample, such as a clinical 
isolate or culture of suspected cells; 

b) isolating/extracting/purifying telomerase from the 
sample; 

c) ^ adding the telomerase extract to a composition 

containing a synthetic DNA primer, which is 
optionally immobilized, complementary to the 
telomeric repeat, and all four dNTPs under conditions 
that result in telomerase specific extension of the 
synthetic DNA; 

d) amplifying the telomerase extended DNA products,, 
preferably using a primer that contains a "linker 
moiety", such as a moiety based on thiol chemistry or 
streptavidin; 
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e) 



isolating linker-amplified primers, such as by using a 
complementary binding partner immobilized on a solid 
support; 

f) optionally conditioning the DNA for crystal formation; 



and 



g) performing MS by ionlzlng/volatizing the sample to 
detect the DNA product. 
Telomerase-specific extension is indicative of neoplaisa/malignancy 
ms method can be used to detect ect specific malignancies. The use 
10 of MS to detect the DNA product pemiits identification the extended 
product, which is indicative of telomerase activity in the sample 
If desired, the synthetic DNA can be In the fonn an array. 

Methods for detecting mutations are provided and ihe use thereof 
oncogenes and to thereby screen for transformed cells, which are 
15 indicative Of neoplasia. Detection of mutations present in oncogenes are 
indicative of transformation. This method Includes the steps of: 

a) obtaining a biological sample; 

b) amplifying a portion of the selected proto-oncogene 

includes a codon indicative of transformation, 
where one primer has a linker moiety for 
immobilization; 

c) immobilizing DNA via the linker moiety to a solid 
support, optionally in the form of an array; 

d) hybridizing a primer complementary to the proto 
oncogene sequence that is upstream from the codon 

e) adding 3dNTPs/1 ddNTP and DNA 
polymerase and extending the 
hybridized primer to the next 
ddNTP location; 
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f) ionizing/vblatizing the sample; and 

g) detecting the mass of the extended DMA, whereby 
mass indicates the presence of wild-type or mutant 
alleles. The presence of a mutant allele at the codon 

5 is diagnostic for neoplasia. 

In an exemplary embodiment, extension-MS analysis is used detect the 
presence of a mutated codon 634 in the retrovirus (RED-proto oncogene. 

In another embodiment, methods for diagnosing diseases using 
reverse transcription and amplification of a gene expressed in 

10 transformed cells, in particular, a method for diagnosis of 

neuroblastoma using reverse transcriptase (RT)-MS of tyrosine 
hydroxylase, which is a catecholamine biosynthetic enzyme that 
expressed in tumor cells, but not in tumor cells but not normal cells, such 
as nonnal bone marrow cells is provided. The method includes the steps 

15 of: 

a) obtaining a tissue sample; 

b) isolating polyA RNA from the sample; 

c) preparing a cDNA library using reverse transcription; 

d) amplifing a cDNA product, or portion thereof, of the 
selected gene, where one oligo primer has a linker 
moiety; 

e) isolating the amplified product by immobilizing the 
DNA to solid support via the linker moiety; 

f) optionally conditioning the DNA: 

g) ionizing/volatizing sample and detecting the presence 
f a DNA peak that is indicative of expression of the selected gene gene. 

For xample, expression of the tyrosine hydroxylase gene is indicative of 
neuroblastoma. 



20 



25 
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Also provided are me*ods of directiy detecting a double-stranded 
nuc,e.c aoid using MALD^TOF MS. These n,e.hods inoiude the steps 

a) .solating a double stranded DNA of an appropriate si2e for ' 
MS via amplification methods or fom»d by hybridization of 
Single-Stranded DNA fragment; 

b) preparing the double-stranded DNA for analysis under 
conditions that Increase the ratio of dsDNArssDNA In which 
the conditions include one or all of the following: preparing 
samples for analysis at reduced temperatures (i^ 4 - c) 
and using of higher DNA concentrations in the matrix to ' 
drive duplex formation 

0 ionizing/volatizing the sample of step b), where this step 
uses low acceleration voltage of the ions to assist in 
maintaining duplex DNA by. for example, adjusting laser 

P°^«^*°i"st above threshold irradiation for ionization, and 
d) detecting the presence of the dsDNA of the appropriate 

mass. 



: ^preferred embodiments. a,e matrix Includes 3-hydroxypico.inic acid 
Tfe detected DNA can be i^licadve of a gene«c disorder. gene«c ' 
d,sease. genebc predisposition to a disease chromosomal abnonnalibes 

If n r'^'^'*- ™" "-"'e ^'-n-e- DNA is indicative 

ot the deletion, insertion, mutation. 

A method designated primer oligo base extension (PROBE) is 
provided. This method uses a single detecUon primer followed by an 
2S ol,gonuc.e.«de «cension s«.p to give products, which can be readily 
resolved by MALOI-TOF mass spectrometry. T„e pr«,uc.s differ in 
'ength byanumber ' bases specific for a numb r o, repeat units r for 
second site mutations within the repeated region. The me«,od is 
exemphfied using as a model system the AluVpA polymorphism in intron 
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5 of the interferon-a receptor gene located on human chromosome 21 , 
and the poly T tract of the splice acceptor she of intron 8 from the CFTR 
gene located on human chromosome 7. The method is advantageously 
used for example, for determining identity, identifying mutations, familial 
6 relationship, HLA compatability and other such markers ,using PROBE-MS 
analysis of microsatellite DNA. In a preferred embodiment, the method 

includes the steps of: 

a) obtaining a biological sample from two individuals; 

b) amplifying a region of DNA from each individual that 
contains two or more microsatellite DNA repeat sequences 

c) ionizing/volatizing the amplified DNA; 

d) detecting the presence of the amplified DNA and comparing 
the molecular weight of the amplified DNA. Different sizes 
are indicative of non-identity (Ls^ wild-type versus 

15 mutation), non-heredity or non-compatibility; similar size 

fragments indicate the possibility identity, of familial 
relationship, or HLA compatibility. 
More than one marker may be examined simulataneoulsy. primers 
with different linker moieties are used for immobilization. 
20 Another method loop-primer oligo base extension, designated 

LOOP-PROBE, for detection of mutations especially predominant disease 
causing mutations or common polymorphisms is provided. In a particular 
embodiment, this method for detecting target nucleic acid in a sample, 

includes the steps of : 
26 a) amplifying a target nucleic acid sequence, such as ^-globin, 

in a sample, using (i) a first primer whose 5'-end shares 
identity to a portion of the target DNA immediately 
downstream from the targeted codon followed by a 
sequence that introduces a unique restriction endonuclease 
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site, such as Cfol in the case of jff-globin, into the amplicon 
and whose 3'-end primer is self-complementary; and (ii) a 
second downstream primer that contains a tag, such as 
biotin. for immobilizing the DNA to a solid support, such as 
streptavidin beads; 

c) immobilizing the double-stranded amplified DNA to a solid 
support via the linker moiety; 

d) denaturing the immobilized DNA and Isolgting the non- 
immobilized DNA strand; 

e) annealing the Intracomplementary sequences in the 3'-end 
of the isolated non-immobilzed DNA strand, such that the 
3'-end is extendable by a polymerase, which annealing can 
be performed, for example, by heating then and cooling to 
about 37° C, or other^suitable method; 

f) extending the annealed DNA by adding DNA polymerase. 

3 dNTPs/1 ddNTP, whereby the 3'-end of the DNA strand is 
extended by the DNA polymerase to the position of the next 
ddNTP location (i.e., to the mutation location); 

g) cleaving the extended double stranded stem loop DNA with 
the unique restriction endonuclease and removing the 
cleaved stem loop DNA 

i) (optionally adding a matrix) ionizing/volatizing the extended 
product; and 

j) detecting the presence of the extended target nucleic acid, 
whereby the presence of a DNA fragment of a mass 
different from wild-type is indicative of a mutation at the 
targ t c don(s). 

This metiiod liminates on specific reagent for mutation detection 

c mpared otfier methods of MS mutational analys s, thereby simplifying 
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the process and rendering it amenable to automation. Also, the specific 
extended product that is analyzed is cleaved from the primer and is 
therefore shorter compared to the other methods. In addition, the 
annealing efficiency is higher compared to annealing of an added primer 
5 and should therefore generate more product. The process is compatible 
with multiplexing and various detection schemes ( e.g.. single base 
extension, oligo base extension and sequencing). For example, the 
extension of the loop-primer can be used for generatioff of short 
diagnostic sequencing ladders within highly polymorphic regions to 

10 perform, for example, HLA typing lor resistance as well as species typing. 
in another emodiment, a methods of detecting a target nucleic 
acid in a biological sample using RNA amplification is provided. In the 
method, the target is amplified the target nucleic acid, using a primer that 
shares a region complementary to the target sequence and upstream 

15 encodes a promoter, such as the T7 promoter- A DNA-dependent RNA 
polymerase and appropriate ribonucleotides are added to synthesize RNA, 
which is analyzed by MS, 

improved methods of sequencing DNA using MS are provided. In 
these methods thermocycling for amplification is used prior to MS 

20 analysis, thereby increasing the signal. 

Also provide are primers for use in MS analyses. In particular, 
primers, comprising all or, for longer oligonucleotides, at least about 20, 
preferably about 1 6, bases of any of the sequence of nucleotides 
sequences set forth in SEQ ID NOs. 1-22, 24, 27-38, 41-86, 89, 92, 95, 

25 98, 101-110, 112-123, 126, 128, 129, and primers set forth in SEQ ID 
Nos. 280-287; The primers are unlabeled, and optionally include a mass 
modifying moiety, which is preferably attached to the 5' nd. 
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Other features and advantages of the methods provided herein will 
be further described with reference to the following Rgures, Detailed 
Description and Claims. 
BRIEF DESCRIPTION OF THE FIGURES 
5 FIGURE 1 A is a diagram showing a process for performing mass 

spectrometric analysis on one target detection site (TDS) contained 
w,thin a target nucleic acid molecule (T), which has been obtained from a 
b.ological sample. A specific capture sequence (C) is attached to a solid 
support (SS) via a spacer (S). The capture sequence is chosen to 
10 specifically hybridize with a complementary sequence on the target 

nucleic acid molecule (T), known as the target capture site (TCS) The 
spacer (S) facilitates unhindered hybridization. A detector nucleic acid 
sequence (D), which is complementary to the TDS is then contacted with 
the TDS. Hybridization between D and the TDS can be detected by mass 
15 spectrometry. 

FIGURE 1 B is a diagram showing a process for performing mass 
spectrometric analysis on at least one target detection site (here TDS 1 
and TDS 2) via direct linkage to a solid support. The target sequence (T) 
^ containing the target detection site (TDS 1 and TDS 2) is immobilized to 
a solid support via the formation of a reversible or irreversible bond 
formed between an appropriate functionality (L') on the target nucleic 
acid molecule (T) and an appropriate functionality (L) on the solid 
support. Detector nucleic acid sequences (here D1 and D2), which are 
complementary to a target detection site (TDS 1 or TDS 2) are then 
25 contacted with the TDS. Hybridization between TDS 1 and Dl and/or 
TDS 2 and D2 can be detected and distinguished based on molecular 
weight differenc s. 

FIGURE IC is a diagram showing a process for detecting a 
wildtype (D«^, and/or a mutant (D"-, sequence in a target (T, nucleic acid 
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molecule. As in Figure 1A, a specific capture sequence (C) is attached to 
a solid support (SS) via a spacer (S). In addition, the capture sequence is 
chosen to specifically interact with a complementary sequence on the 
tkrget sequence (T), the target capture site (TCS) to be detected through 
5 hybridization. If the target detection site (TDS) includes a mutation, X, 
etection sites can be distinguished from wildtype by mass spectrometry. 
Preferably, the detector nucleic acid molecule (D) is designed so that the 
mutation is in the middle of the molecule and therefore would not lead to 
a stable hybrid if the wildtype detector oligonucleotide (D*«) Is contacted 
10 with the target detector sequence, as a control. The mutation can 
also be detected if the mutated detector oligonucleotide (D"*") with the 
matching base at the mutated position is used for hybridization. If a 
nucleic acid molecule obtained from a biological sample is heterozygous 
for the particular sequence (i.e. contain D*** and D**« and D*™* will be 

15 bound to the app and O™^ to be detected simultaneously. 

FIGURE 2 is a diagram showing a process in which several 
mutations are simultaneously detected on one target sequence molecular 
weight differences between the detector oligonucleotides D1, D2 and D3 
must be large enough so that simultaneous detection (multiplexing) is 
20 possible. "This can be achieved either by the sequence itself (composition 
or length) or by the introduction of mass-modifying functionalities M1-M3 
into the detector oligonucleotide. 

FIGURE 3 is a diagram showing still another multiplex detection 
format, in this embodiment, differentiation is accomplished by employing 
25 different specific capture sequences which are position-specifically 
immobilized on a flat surface (e^ a 'chip an^ay'). If different target 
s quenc s Tl-Tn are present, their target capture sites TCSI-TCSn will 
interact with complementary immobilized capture sequences Cl-Cn. 
Detection is achieved by employing appropriately mass differentiated 
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detector oligonucleotides Dl-Dn, which are mass differentiated either by 
their sequences or by mass modifying functionalities Ml-Mn. 

FIGURE 4 is a diagram showing a format wherein a predesigned 
target capture site (TCS) Is incorporated into ti,e target sequence using 
5 nucleic acid (Le^ PGR) amplification. Only one strand is captured the 
otiier Is removed (e^ based on tiie interaction between blotin and 
streptavidin coated magnetic beads). If the biotin is attached to primer 1 
tiie otiier sti-and can be appropriately marked by a TCS. Detection is as 
described above through tiie interaction of a specific detector 
10 Oligonucleotide D with tiie corresponding target detection site TDS via 
mass spectrometry. 

FIGURE 5 is a diagram showing how amplification (here ligase 
chain reaction (LCR)) products can be prepared and detected by mass 
spectrometiy. Mass differentiation can be achieved by tiie mass 
15 modifying functionalities (Ml and M2) attached to primers (Pi and P4 
respectively). Detection by mass spectrometry can be accomplished 
d.rectiy (i.e. witiiout employing immobilization and target capturing sites 
(TCS)). Multiple LCR reactions can be performed in parallel by providing 
an ordered array of capturing sequences (C). This format allows 
20 separation of tiie ligation products and spot by spot identification via 
mass spectrometry or multiplexing if mass differentiation Is sufficient. 

FIGURE 6A Is a diagram showing mass spectrometric analysis of a 
nucleic acid molecule, which has been amplified by a transcription 
amplification procedure. An RNA sequence is captured via Its TCS 
25 sequence, so tfiat wildtype and mutated target detection sites can be 
detected as above by employing appropriate detector oligonucleotides 
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FIGURE 6B is a diagram showing multiplexing to detect two 
different (mutated) sites on the same RNA in a simultaneous fashion 
using mass-modified detector oligonucleotides M1-D1 and M2-D2. 

FIGURE 6C is a diagram of a different multiplexing procedure for 
5 detection of specific mutations by employing mass modified 

dideoxynucleoside or 3'-deoxynucieoside triphosphates and an RNA 
dependent DNA polymerase. Alternatively, DNA dependent RNA 
polymerase and ribonucleotide phosphates can be employed. This format 
allows for simultaneous detection of all four base possibilities at the site 
10 of a mutation (X). 

FIGURE 7A is a diagram showing a process for performing mass 
spectrometric analysis on one target detection isite (TDS) contained 
within a target nucleic acid molecule (T), which has been obtained from a 
biological sample. A specific capture sequence (C) is attached to a solid 
15 support (SS) via a spacer (S). The capture sequence is chosen to 

specifically hybridize with a complementary sequence on T known as the 
target capture site (TCS). A nucleic acid molecule that is complementary 
to a portion of the TDS is hybridized to the TDS 5' of the site of a 
mutation (X) within the TDS. The addition of a complete set of 
20 dideoxynucleosides or 3'-deoxynucleoside triphosphates (e.g., pppAdd, 
PPpTdd, pppCdd and pppGdd) and a DNA dependent T)NA or RNA 
polymerase allows for the addition only of the one dideoxynucleoside or 
3'-deoxynucleoside triphosphate that is complementary to X. 

FIGURE 7B Is a diagram showing a process for performing mass 
25 spectrometric analysis to determine the presence of a mutation at a 
potential mutation site (M) within a nucleic acid molecule. This format 
allows for simultaneous analysis f all les (A) and (B) of a double 
stranded target nucleic acid molecule, so that a diagnosis of homozygous 
normal, homozygous mutant or heterozygous can be provided. Allele A 
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and B ^^^^^^ oomp,e.e„«^ o«Bo„uc.eo«es „c, and 

iO> respec^ve.,,. *a. hvbridlze .„ A and B wl«,.„ a .e«ion «,a. .ncL, 
M. Each he.erodap.ex is *en contacted wfth a single strand speciflc 
endonuceaee. so that a nWsn,atcH at M. indicating «.e presence o, a 
mutagen resuits In the cleavage ol ,C, a„d,or ,D,. wHich can then be 
detected by mass spectrometry. 

can be prepared ,or detection using transc„p«„„ vector^ having L 
different promoters at opposite locations fe^ u,e SP6 end T7 

ZTr^ O't-Snghetero^gous 
target detections sites (TDS). Employing the SP6 or the T7 RNA 

polymerase both strands could be transcribed separately or 
s-multaneously. T^e transcribed RNA molecules can be specifically 
captured and simultaneously detected using appropri«elymass- 
d.fferent,a,ed detector Oligonucleotides. This can be accomplished ei.h» 

an ordered array of specifically immobilized cap«,ring scuences 

fiauresT ""r^p ' "T" """""^ "^^ «^ 
20 ,ir„ , ' '"^ « '"^ ^P^cfflcally digested using one or more 
20 nbonucleases and the fragments captured on a solid support carrvino ,h. 
corresponding complementary se<.e„ces. Hybrid.atioWrZ:r 
actual molecular we,gh.s of the captured target sequences provide 
-formation on whether «,d vWwe mutations in the gene are present 

25 cl b"' r '""''^ ^ ^"^ "^"O spectrometry. DNA 
can be s.m.larly digested using a cocwail Of nucleases including 
res«,caon endonucleases. Mutations can be detected by different 
HK. acular weights of specific. indivWua, fragments compar d .o the 
molecular weights of ti« wildtype fragments. 



wo 98/20166 



PCTAJS97/20444 



-23- 

FIGURE 10A shows UV spectra resulting from the experiment 
described in the following Example 1. Panel I) shows the absorbance of 
the 26-mer before hybridization. Panel ii) shows the filtrate of the 
centrifugation after hybridization. Panel iii) shows the results after the 
B first wash with 50 mM ammonium citrate. Panel iv) shows the results 
after the second wash with 50 mM ammonium citrate. 

- FIGURE 10B shows a mass spectrum resulting from the experiment 
described in the following Example 1 after three washing/centrifugation 
steps. 

10 FIGURE IOC shows a mass spectrum resulting from the 

experiment described in the following Example 1 showing the successful 
desorption of the hybridized 26-mer off of beads in accordance with the 
format depicted schematically in Rgure 1 B. 

FIGURE 1 1 shows a mass spectrum resulting from the experiment 
15 described in the following Example 1 showing the giving proof of an 

experiment as schematically depicted in FIGURE IB successful desorption 
of the hybridized 40-mer. The efficiency of detection suggests that 
fragments much longer than 40-mers can also be desorbed.Figure 12 
shows a mass spectrum resulting from the experiment described in the 
20 following Example 2 showing the successful desorption and 
differentiation of an 18-mer and 19-mer by electrospray mass 
spectrometry, the mixture (top), peaks resulting from 1 8-mer emphasized 
(middle) and peaks resulting from 1 9-mer emphasized (bottom) 

FIGURE 13 is a graphic representation of the process for detecting 
25 the Cystic Rbrosis mutation AF508 as described in Example 3. 

FIGURE 14 is a mass specti^um of the DNA extension product of a 
AF508 homozygous normal of Example 3. 

FIGURE 15 is a mass spectrum of the DNA extension product of a 
AF508 heterozygous mutant of Example 3. 
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FIGURE 16 Is a mass spectrum of the DNA extension product of 
AF508 homozygous normal of Example 3. 

FIGURE 1 7 is a mass spectrum of the DNA extension product of 
AF508 homozygous mutant of Example 3, 
S FIGURE 1 8 is a mass spectrum of the DNA extension product of . 

AF508 heterozygous mutant of Example 3. 

FIGURE 1 9 is a graphic representation of various processes for 
performing apolipoprotein E genotyping of Example 4. _ 

FIGURE 20 shows the nucleic acid sequence of normal 
10 apolipoprotein E (encoded by the E3 allele, FIG. 20B) and other isotypes 
encoded by the E2 and E4 alleles (FIG. 20A). 

FIGURE 21 A shows a composite restriction pattern for various 
genotypes of apolipoprotein E using the Cfol restriction endonuclease. 
FIGURE 21 B shows the restriction pattern obtained in a 3 5% 
1 5 MetPhor Agarose Gel for various genotypes of apolipoprotein E. 

FIGURE 21 C shows the restriction pattern obtained in a 1 2% 
polyacrylamide gel for various genotypes of apolipoprotein E. 

FIGURE 22A is a chart showing the molecular weights of the 9 1 
83. 72, 48 and 35 base pair fragments obtained by restriction enzyme ' 
20 cleavage of the E2, E3 and E4 alleles of apolipoprotein E. 

FIGURE 22B is the mass spectrum of the restriction product of a 
homozygous E4 apolipoprotein E genotype, 

FIGURE 23A is the mass spectrum of the restriction product of a 
homozygous E3 apolipoprotein E genotype. 
25 FIGURE 23B is the mass spectrum of the restriction product of a 

E3/E4 apolipoprotein E genotype. 

FIGURE 24 is an autoradiograph of Example 5 of a 7.5% 
polyacrylamide gel in which 10% (Bpl) of each amplified sample was 
loaded: samfiL M: pBR322 A/uf digest d; sarnfilgj..- HBV positiv in 
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serological analysis; sample 2 : also HBV positive; sample 3 : without 
serological analysis but with an increased level of transaminases, 
indicating liver disease; sample 4 : HBV negative containing HCV; sample 
5: HBV posit-) negative control; ( + ) positive control). Staining was done 
5 with ethidium bromide. 

FIGURE 25A is a mass spectrum of sample 1 , which Is HBV 
positive. The signal at 20754 Da represents the HBV related 
amplification product (67 nucleotides, calculated mass: "20735 Da). The 
mass signal at 10390 Da represents the [M + 2H12* molecule ion 
10 (calculated: 10378 Da). 

FIGURE 25B is a mass spectrum of sample 3, which is HBV 
negative corresponding to nucleic acid (i.e., PGR), serological and dot 
blot based assays. The amplified product is generated only in trace 
amounts. Nevertheless It is unambiguously detected at 20751 Da 
15 (calculated mass: 20735 Da), The mass signal at 10397 Da represents 
the IM + 2H12* molecule ion (calculated: 10376 Da). 

FIGURE 250 is a mass spectrum of sample 4, which is HBV 
negative, but HCV positive. No HBV specific signals were observed. 

FIGURE 26 shows a part of the £. coli lac\ gene with binding sites 
20 of the complementary oligonucelotides used In the ligase chain reaction 
(LCR) of Example 6. Here the wildtype sequence is displayed. The 
mutant contains a point mutation at bp 191 which is also the site of 
ligation (bold). The mutation is a C to T transition (G to A, respectively). 
This leads to a T-G mismatch with oligo B (and A-C mismatch with oligo 
25 C, respectively). 

FIGURE 27 is a 7.15% polyacrylamide gel of Example 6 stained 
with ethidium bromide. M: chain length standard (pUCISDNA, Msp\ 
digested). Lane 1: LCR with wildtype template. Lane 2: LCR with 
mutant template. Lane 3: (control) LCR without template, the ligation 
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product (50 bp) was only generated in the positive reaction containing 

wildtype template. 

FIGURE 28 is an HPLC chromatogram of two pooled positive LCRs. 
FIGURE 29 shows an HPLC chromatogram the same conditions but 
5 mutant template were used. The small signal of the ligation product is 
due to either template-free ligation of the educts or to a ligation at a (G- 
T, A-C) mismatch. The 'false positive' signal is significantly lower than 
the signal of ligation product with wildtype template depicted in Figure 
28. The analysis of ligation educts leads to 'double-peaks' because two 
10 of the oligonucleotides are 5'-phosphorylated. 

FIGURE 30 In (b) the complex signal pattern obtained by MALDI- 
TOF-MS analysis of Pfu DNA-Iigase solution of Example 6 Is depicted. In 
(a) a MALDI-TOF-spectrum of an unpurified LCR is shown. The mass 
signal 67569 Da probably represents the Pfu DNA ligase. 
15 FIGURE 31 shows a MALDI-TOF spectrum of two pooled positive 

LCRs (a). The signal at 7523 Da represents unligated oligo A (calculated: 
7521 Da) whereas the signal at 1 5449 Da represents the ligation product 
:(calculated: 15450 Da). The signal at 3774 Da is the [M + 2HP* signal 
of oligo A. The signals in the mass range lower than 2000 Da are due to 
20 the matrix ions. The spectrum corresponds to lane 1 in figure 27 and the 
chromatogram in figure 28. In (b) a spectrum of two pooled negative 
LCRs (mutant template) is shown. The signal at 7517 Da represents 
oligo A (calculated: 7521 Da). 

FIGURE 32 shows a spectrum of two pooled control reactions 
25 (with salmon sperm DNA as template). The signals In the mass range 
around 2000 Da are due to Tween20, only oligo A could be detected, as 
xpected. 

FIGURE 33 shows a spectrum of two pool d positive LCRs (a). 
The purification was done with a combination of ultrafiltration and 
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streptavidin DynaBeads as described in the text. The signal at 15448 Da 
represents the ligation product (calculated: 15450 Da). The signal at 
7527 represents oligo A (calculated: 7521 Da). The signals at 3761 Da 
is the IM + 2HI^* signal of oligo A, whereas the signal at 5140 Da is the 
5 [M + 3HI^* signal of the ligation product. In (b) a spectrum of two pooled 
negative LCRs (without template) is shown. The signal at 7514 Da 
represents oligo A (calculated: 7521 Da). 

FIGURE 34 is a schematic presentation of the oligo base extension 
of the mutation detection primer as described in Example 7, using ddTTP 
10 (A) or ddCTP (B) in the reaction mix, respectively, "me theoretical mass 
calculation is given in parenthesis. The sequence shown is part of the 
exon 10 of the CFTR gene that bears the most common cystic fibrosis 
mutation AF508 and more rare mutations AI507 as well as lle506Ser 
FIGURE 35 Is a MALDI-TOF-MS spectrum recorded directly from 
15 precipitated oligo base extended primers for mutation detection. The 
spectrum in (A) and (B), respectively show the annealed primer (CF508) 
without further extension reaction. Panel C displays the MALDI-TOF 
.^ectrum of the wild type by using pppTdd in the extension reaction and 
D a heterozygotic extension products carrying the 506S mutation when 
20 using pppCdd as tenninator. Panels E and F show a heterozygote with 
AF508 mutation witii pppTdd and pppCdd as terminators in the 
xtension reaction. Panels G and H represent a homozygous AF508 
mutation witi, either pppTdd or pppCdd as terminators. The template of 
diagnosis is pointed out below each spectrum and the obsen^ed/expected 
25 molecular mass are written in parenthesis. 

FIGURE 36 shows the portion of tfie sequence of pRFcl DNA 
which was used as template for nucleic acid amplification in Example 8 
of unmodified and 7-dea2apurin containing 99-mer and 200-mer nucleic 
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acids as well as the sequences of the 19-mer forward primer and the two 
1 8-mer reverse primers. 

FIGURE 37 shows the portion of the nucleotide sequence of 
M13mpl8 RFI DNA, which was used in Example 8 for nucleic acid 
5 amplification of unmodified and 7-deazapurine containing 103-mer 
nucleic acids. Also shown are nucleotide sequences of the 1 7.mer 
primers used in the nucleic acid amplification reaction. 

FIGURE 38 shows the result of a polyacrylamide^el 
electrophoresis of amplified products described in Example 8 purified and 
10 concentrated for MALDI-TOF MS analysis. M: chain lengti, marker, lane 
1: 7-dea2apurine containing 99-mer amplified product, lane 2: unmodified 
99-mer, lane 3: 7-dea2apurine containing 103-mer and lane 4: 
unmodified 103-mer amplified product. 

FIGURE 39: an autoradiogram of polyacrylamide gel 
15 electrophoresis of nucleic acid (Le^ PGR) reactions carried out with 5'- 
£='^PI-labeled primers 1 and 4. Lanes 1 and 2: unmodified and 7- 
deazapurine modified 103-mer amplified product (53321 and 23520 
-counts), lanes 3 and 4: unmodified and 7-dea2apurine modified 200-mer 
(71 123 and 39582 counts) and lanes 5 and 6: unmodified and 7- 
20 deazapurine modified 99-mer (1 7321 6 and 94400 counts). 

FIGURE 40 a) MALDI-TOF mass spectrum of ti,e unmodified 103- 
mer amplified products (sum of twelve single shot spectra). The mean 
value of the masses calculated for the two single strands (31768 u and 
31759 u) is 31763 u. Mass resolution: 18. b) MALDI-TOF mass 
25 spectrum of 7-deazapurine containing 1 03-mer amplified product (sum of 
three single shot spectra). The mean value of the masses calculated for 

the two ingle strands (31727 u and 31719 u) is 31723 u. Mass 
r solution: 67. 
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RGURE 41: a, MALOl-TOF mass specUu,n o, unmodified 99- 
™, ampnfied product (sut, of hventy singte sho, spectra). Values of .he 
masses calculated for the two single strands: 30261 u and 30794 u b) 
MALDI-TOF mass spectrum of 7^eazapurine containing 99-mer amplified 
product (sum of ^velve single shot spectra). Values of the masses 
calculated for the two single strands: 30224 u and 307S0 u 

HSURE 42: a) MALDI-TOF mass spectrum of the unmodified 20O- 
mer amplified product (sum of 30 single sho. spec«a,. .^.e mean value 

u) .s 61 734 u. Mass resolution: 28. b) MALDI-TOF mass spectrum of 7 
d«i«purine containing 200Hner amplified product (sum of 30 single sho, 
spectra). The mean value of the masses calculated for the ttvo single 
strands ,61772 u and 617,4 u) is 6,643 u. Mass resolution- 39 

FIGURE 43: a) MALDI-TOF mass specmim of 7-dea«purine 
containing lOO-mer amplified product with ribomodifled primers The 
.nean value of the masses celculated for the two single stands (30S29 u 
and 31095 u, is 30812 u. b) MALDI-TOF mass spectrum of J 
amplified product after hydrolyflc primer-cleavage. The mean value of 

L ZIV """" "^""^ "'^'^ " 5918 u) 

IS oo77 u. . 

FIGURE 44 A-O shows the MALDI-TOF mass specuum of «,e four 
sequencmg ladders obfalned from a 39-mer template (SEQ ID No 231 
Which was immobilized to strepuvidin beads via a 3" biotinylation a'i4- 

z:z. '° "-^ - — - 

HGURE 45 shows a MALDI-TOF mass spectrum of a solid phase 
s quencing of a 78H^er template (SEQ ID No. 25), which was 
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immobilized to streptavidin beads via a 3' biotinylation. A 1 8-mer primer 
(SEQ ID No. 26) and ddGTP were used in the sequencing. 

FIGURE 46 shows a scheme in which duplex DNA probes with 
smgle-stranded overhang capture specific DNA templates and also serve 
5 as pnmers for solid phase sequencing. 

FIGURE 47 A-D shows MALDI-TOF mass spectra obtained from a 
sequencing reaction using 5' fluorescent labeled 23-mer (SEQ ID No 29) 
annealed to a 3' biotinylated 18-mer (SEQ ID No. 30)^leaving a 5-base 
overhang, which captured a 15-mer template (SEQ ID No. 31) as 
10 described in Example 9. 

FIGURE 48 shows a stacking fluorogram of the same products 
obtained from the reaction described in FIGURE 47, but run on a 
conventional DNA sequencer. 

15 . HH ' MALDI-TOF mass spectrum of the sequencing 

ladder using cycle sequencing as described in Example 1 generated from 
a biological amplified product as template and a 12mer (5'-TGC ACC 
: TGA CTC-3' (SEQ ID NO. 34)) sequencing primer. The peaks resulting 
from depurinations and peaks which are not related to the sequence are 
marked by an asterisk. MALDI-TOF MS measurements were taken on a 
20 reflectronTOFMS. A.) Sequencing ladder stopped with dd ATP- B ) 

Sequencing ladder stopped with ddCTP; C.) Sequencing ladder topped 
with ddGTP; D.) Sequencing ladder stopped with ddTTP. 

FIGURE 50 shows a schematic representation of the sequencing 
ladder generated in Rg. 49 with the corresponding calculated molecular 
25 masses up to 40 bases after the primer. For the calculation, the 

following masses were used: 3581.4Da for the primer, 312 2 Da for 7 
deaza-dATP, 304.2 Da for dTTP. 289.2 Da for dCTP and 328 2 Da for 7- 
deaza-dGTP. 
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FIGURE 51 shows the sequence of the amplified 209 bp amplified 
product within the /?-globin gene, which was used as a template for 
sequencing. The sequences of the appropriate amplification primer and 
the location of the 1 2mer sequencing primer is also shown. This 
sequence represents a homozygote mutant at the position 4 bases after 
the primer. In a wiidtype sequence this T would be replaced by an A. 

FIGURE 52 shows a sequence which is part of the intron 5 of the 
interferon-receptor gene that bears the AluVpA polymorphism as further 
described in Example 11. The scheme presents the primer oligo base 
extension (PROBE) using ddGTP, ddCTP, or both for termination, 
respectively. The polymorphism detection primer (IFN) is underiined, the 
temiination nucleotides are marked in bold letters. The theoretical mass 
values from the alleles found in 28 unrelated individuals and a five 
member family are given in the table. Both second site mutations found 
15 in most 13 units allele, but not all, are indicated. 

FIGURE 53 shows the MALDl-TOF-MS spectra recorded directly 
form precipitated extended cyclePROBE reaction products. Family study 
using AluVpA polymorphism in intron 5 of the interferon-^ receptor gene 
(Example 11). 

20 FIGURE 54 shows the mass spectra from PROBE products using 

ddC as termination nucleotide in the reaction mix. The allele with the 
molecular mass of approximately 1 1 650 da from the DNA of the mother 
and child 2 is a hint to a second site mutation within one of the repeat 
units. 

25 FIGURE 55 shows a schematic presentation of the PROBE method 

for detection of different alleles in the polyT tract at the 3'-end of inti-on 
8 of the CFTR gene with pppCdd as terminator (Example 11). 

FIGURE 56 shows the MALDI-TOF-MS spectra recorded directiy 
from the precipitated ext nded PROBE reaction products. Detection of all 
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three common alleles of the polyT tract at the 3' end of Intron 8 of the 
CFTR gene, (a) T5/T9 heterozygous, (b) T7/T9 heterozygous (Example 

FIGURE 57 shows a mass spectrum of the digestion of a 252-mer 
ApoE gene amplified product (^3/^3 genotype) as described In Example 

12 using a) Cfol alone and b)CfoI plus Rsal. Asterisks: depurinatlon 

peaks. 

FIGURE 58 shows a mass spectrum of the ApoE^ene amplified 
product (.3/.3 genotype) digested by Cfol and purified by a) single and 
10 b) double ethanol/glycogen and c) double isopropyl aicohol/glycogen 
precipitations. 

FIGURE 59 shows a mass spectrum of the Cfol/Rsal digest 
products from a) e2/e3, b) eZfeZ. c) eZ/eA. and d) genotypes. 
Dashed lines are drawn through diagnostic fragments. 
1 5 FIGURE 60 shows a scheme for rapid identification of unknown 

ApoE genotypes following simultaneous digestion of a 252-mer apo E 
gene amplified product by the restriction enzymes Cfol and Rsal. 

FIGURE 61 shows the multiplex (codons 1 1 2 and 1 58) mass 
spectrum PROBE results for a) e2le3. b) e3U3, c) e3/.4, and d) ^4/e4 
genotypes. E: extension products; P: unextended primer. Top: codon 
112 and 158 regions, with polymorphic sites bold and primer sequences 
underlined. 

FIGURE 62 shows a mass spectrum of a TRAP assay to detect 
telomerase activity (Example 1 3). The spectrum shows two of the 
25 pnmer signals of the amplified product TS primer at 5,497.3 Da (calc 
5523 Da) and the biotinylated bioCX primer at 7,537.6 Da (calc 7 537 
Da) and the first telomerase-specific assay product containing thr e 
t lomeric repeats at 12,775.8 Da (calc. 12,452 Da) its mass is larger by 




wo 98/20166 PCT/US97/2a444 

-33- 

one dA nucleotide (12,765 Da) due to extendase activity of Taq DNA 
polymerase. 

FIGURE 63 depicts the higher mass range of FIGURE 62, i.e, the 
peak at 1 2,775.6 Da represents the products with these telomeric 
5 repeats. The peaks at 20,322.1 Da is the result of a telomerase activity 
to form seven telomeric repeats (calc. 20,395 Da including the extension 
by one dA nucleotide). The peaks marked 1, 2, 3 and 4 contain a four 
telomeric repeats at 1 4,674 Da as well as secondary ion product. 

FIGURE 64 displays a MALDI-TOF spectrum of the RT-amplified 
10 product of the human tyrosine hydroxylase mRNA indicating the 

presence of neuroblastoma cells (Example 14). The signal at 18,763,8 
Da represents the non-biotinylated single-stranded 61 mer of the nested 
amplified product (calc. 18,758.2 Da). 

FIGURE 65 (a) shows a schematic representation of a PROBE 
15 reaction for the RET proto-oncogene with a mixture of dATP, dCTP, 
- dGTP, and ddTTP (Example 15). B represents biotin, through which the 
sense template strand is bound through streptavidin to a solid support. 
Rgure 65(b) shows the expected PROBE products for ddT and ddA 
reactions for wildtype, C-^T, and C-^A antisense strands. 
20 FIGURE 66 shows the PROBE product mass spectra for (a) 

negative control, (b) Patient 1 being heterozygote (Wt/C-*T) and (c) 
Patient 2 being heterozygote (Wt/C— A), reporting average values, 

FIGURE 67 shows the MALDI-FTMS spectra for synthetic analogs 
representing ribo-cleaved RET proto-oncogene amplified products from (a) 
25 wildtype, (b) G-^A, and (c) G-*T homozygotes, and (d) wildtype/G-^A, (e) 
wildtype/G-^T, and (f) G->A/G-*T heterozygotes, reporting masses of 
most abundant isotope peaks. 

FIGURE 68 is a schematic representation of nucleic acid 
immobilization via covalent bifunctional trityl linkers. 
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FIGURE 69 is a schematic representation of nucleic acid 
immobilization via hydrophobic trityl linkers. 

^° ^o"^ - MALOI-TOF mass spectrum of a supema«„, 
of the matrix treated Dynabeads containing bound olioo (S- - 
B TCCACCT3ACTC. SEO ,0 WO. aS,. An i^emal ^Z!^ " 
•CTGTGeTCGTGC, SEO ID NO. 57, was included in «,e ma«1x 

of«,emT'*" •'^'S'-™'=™=''"'««™"'o'»S"--na,an, 

TGCA^r'"" """""'"'^ ^--iminobiotin - 

TGCACCTGACTC, SEQ ID NO. S6,. An internal standard 

10 (CTGTGGTCGTGC, SEQ ID NO. 67, was included in the matrix 

FIGURE 72 schematically depicts the steps involved with the Looo 
pnmer oligo base extension (Loop^robe, reaction 

after " ™" °' " « 

after Cfol d-ges, of a stem loop. Hgure 73B-D show MALDI-TOF mass 

IB spectrum of different genotypes: HbA the wildtype genotype ,748, TbC 
a mut«ion of codon 6 of the ^-globin gene which causes^c Je c! ' 
■disease ,74C,, and HbS. a different mutation of codon 6 of the ^-globin 
gene which causes sickle cell disease (740). 

20 «gK,nofCKR-S. The underiined sequence corresponds «,«,e region 
.K,mologous to the amplification primers. n,e dotted region corresponds 
to the 32 bp deletion. H"nas 

FIGURE 75 shows the sense primer ckrT7f. Being desian«rf to 
f^^Ktate binding of T7.RNA polymerase and ampli«ca Jn ^rcK -B 
2B «g.on to be analyzed, n starts with a randomly chosen sequence Tf 
bases, the T7 promoter sequence of 18 bases and the sequence 
homologous to CKfi-5 of 19 bases. 
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FIGURE 76 is a MALDI-TOF mass spectrum of the CKR-5 
amplification product, which was generated as described in the following 
Example 21. 

FIGURE 77 is a positive ion UV-MALDI mass spectra of a synthetic 
5 RNA 25-mer (5'-UCCGGUCUGAUGAGUCCGUGAGGAC-3' SEQ ID 

NO. 62) digested with selected RNAses. For each enzyme 0.6//I aliquots 
of teh 4.5/A assay containing a total of ca, 20 pmol of the RNA were 
fixed with I.S/rf matrix (3-HPA) for analysis. Fragments with retained 5'. 
terminus are marked by different arrows, specific for the different 
10 RNAses, (Hahner et aL, Proceedings of the 44*^ ASMS Conference on 
Mass Spectrometry and Allied Topics, p. 983 (1996)). 

FIGURE 78 is an investigation of the specificity of the RNAses CL3 
and Cusativin by positive ion UV-M ALDI mass spectra of a synthetic RNA 
20-mer. Expected and/or observed cleavage sites are indicated by 
15 arrows. A, B, C indicate correct cleavage sites and corresponding singly 
cleaved fragments. Missing cleavages are designated by a question mark 
(7), unspecific cleavages by an X, 

FIGURE 79 shows the separation of a mixture of DNA molecules 
(12-mer, 6'-biot. 19-mer, 22-mer and 5'-biot. 27-mer) with streptavidin- 
20 coated magnetic beads, a) positive ion UV-MALDI mass spectrum of 
0,6//l of a mixture containing ca. 2-4 pmol of each species mixed with 
1,5//I matrix (3-HPA), b) same as a) but incubation of the mixture with 
magnetic beads and subsequent release of the captured fragments. 
FIGURE 80 Elution of immobilized 5' biotinylated 49 nt in vitro 
25 transcript from tiie streptavidin-coated magnetic beads. Positive UV- 
MALDI mass spectrum of the transcript prior to incubation with the 
magnetic beads (a). Sp ctra of tiie immobilized RNA transcript after 
lution with 95% formamide alone (b) and with various additives such as 
lOmM EDTA (c), lOmM CDTA (d) and 25% ammonium hydroxide (e); 
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EDTA and CDTA were adjusted with 25% ammonium hydroxide to a oH 
of 8. ^ 



Positive UV-MALD, mass spectra Of the 5' biotinylated 
49 nt in yjtro transcript after RNAse U, digest for 1 5 minutes a) 

5 Spectrum Of the 25 ul assay containing ca. 1 00 pmol of the target RNA 
before separation; b) spectrum after isolation of the 5 '-biotinylated 
fragments with magnetic beads. Captured fragments were released by a 
solution of 95o^ formamide containing 10 mM CDTA. .1 ul aliquots of 
the samples were mixed with 1.5 ul matrix (3-HPA) in both cases 

10 '^•^""E 82 schematically depicts detection of putative mutations 

m the human ^lobin gene at codon 5 and 6 and at codon 30, and the 
iVS-1 donor site, respectively, done in parallel. FIGURE 82A shows 
amplification of genomic DNA using the primers >ff2 and ^ff 1 1 The 
location of the primers and identification tags as well as an indication of 
the w,ld type and mutant sequences are shown. FIGURE 82B shows 

ana,y i3 of both sites in a simple Primer Reaction Oligo Base Extension 
(PROBE) using primers ^-TAGI (which binds upstream of codon 5 and 6) 
and ^-TAG2 (which binds upstream of codon 30 and the I VS-1 donor 
srte). Reaction products are captured using streptavidin-coated 
20 paramagnetic particle bound biotinylated capture primers (cap-tag- 1 and 
cap-tag-2, respectively), that have 6 bases at the 5' end that are 
complementary to the 5' end of ^TAGI and ^-TAG2,. respectively, and a 
portion which binds to a universal primer. 

FIGURE 83 shows a mass spectrum of the PROBE products of a 
DNA sample from one individual analyzed as described schematically in 
FIGURE 82. 

FIGURE 84 shows a mass spectrum of the sequence bound to cap- 
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FIGURE 85 shows a mass spectrum obtained by using the fi-TAG 1 
and fi-TAG 2 primers in one sequencing reaction using ddATP for 
termination and then sorting according to the method depicted in FIGURE 
82. 

5 FIGURE 86 shows a mass spectrum obtained by using the fi-TAGl 

and ^-TAG2 primers in one sequencing reaction using ddCTP for 
terminati'on and then sorting according to the method depicted in FIGURE 
82. 

FIGURE 87 A shows the wildtype sequence of a fragment of the 
10 chemokine receptor CKR-5 gene with primers (bold) used for 

amplification. The 32 base pair (bp) deletion in the CKR-5 allele is 
underlined; and the stop nucleoti'des are In italic. In FIGURE 87B, the 
wildtype strands are depicted with and without an added Adenosine, 
their length and molecular masses are indicated. FIGURE 87C indicates 
15 the same for the 32 bp deletion. FIGURE 87D shows the PROBE 
products for the wildtype gene and FIGURE 87E shows the mutated 
allele. 

FIGURE 88 shows the amplification products of different unrelated 
individuals as analyzed by native polyacrylamide gel electrophoreses 

20 (15%) and silver stain. The band corresponding to a wildtype CKR-5 
runs at 75 bp and the band from the gene with the deletion at 43 bp. 
Bands bigger than 75 bp are due to unspecific amplification. 

FIGURE 89A shows a spectrograph of DNA derived from a 
heterozygous individual: the peak with a mass of 2331 9 Da corresponds 

15 to the wildtype CKR-5 and the peaks with masses of 1 31 37 Da and 
13451 Da to the deletion allele with and without an extra Adenosine, 
respectively. FIGURE 89B shows a spectrograph of DNA obtained from 
th am Individual as in FIGURE 89A. but the DNA was treated with T4 
DNA polymerase to remove the added Adenosine. FIGURES 89C and 
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890 are spectrographs derived from homozygous individuals and in 
FIGURE 89D, the Adenosine has been removed. AI. peaks with masses 
lower than 13000 Da are due to multipfe charged molecules. 

FIGURE 90A shows the mass spectrum of the results of a PROBE 
7tOUR^TT °" ^""^ ' Heterozygous individual. 

FIGURE 90B shows a mass spectrum of the results of a PROBE reaction 
on a homozygous individual. The peaks with masses of 6604 Da and 
6607 Da, respectively correspond to the wildtype allele, and the peak 
witi, a mass of 6275 Da to the deletion allele. TUe primer is detected 
lO with a mass of 5673 and 5676 Da, respectively. 

FIGURE 91 shows a MALDI-TOF MS spectra of a thermocycling 
pnmer Oligo Base Extension (tc-PROBE, reaction as described in Example 
24 usmg three different templates and 5 different PROBE primers 
simultaneously in one reaction. 
^5 FIGURE 92 schematically depicts a single tube process for 

amplifying and sequencing exons 5-8 of the p53 gene as described in 
Example 25. The mass spectrum is the A reaction of Figure 93 
■ FIGURE 93 shows a superposition plot of four separate reactions 
for sequencing a portion of axon 7 of the p53 gene as described in 
20 Example 25. 

FIGURE 94 shows the mass spectrum obtained from the A reaction 
for sequencing a portion of axon 7 of the p53 gene as described in 
Example 25. 

"^""'^^ *«'"»=»»P««~"' of a PS3 sequencing ladder 

25 for wh.ch 6nL of each ,eac«on were transferred ,o we«s of a chfp and 
measured by MALDI-TOF. 

FIGURE 96A shows a MALDI-TOF mass spectra of a synth «c 60- 
m r (15.34 kDa) mixed with 27-mer„ (non-c mplementary. 8.30 kDa). 
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u « MALDI-TOF mass spectra of a synthetic 50- 
HGURE96Bshows a MA.^^^^ ,,omp.ementarv. 8.34 kDa). The 

final concentraton of each 9 -.fie dsDNA. 

a3.e8 KOa in Pigure ^e^^^^^^^ ^ 
3 ..CURE 97A ^^o^-^^^^^^ ^ ^ ^^.^^^^^^^^^^^ ^ ,,3 

...gest products of a ^^^^J^^^^^^ ,e. 
genotype) , using sample ^^^^^^ 3amp.es 
FIGURE 97B is the same as ngu 

FIGURE 98 shows . MAL^™ ^^^^ 

...uHaoeous,, double ^ 3a,nples prepared « 

4 of the apoHpoprotein E gene (e4 genotyp 

^ ... ™e«ra obatined on a small population 
loF <)9 shows the mass spectra ouo 
OGURE 99 ^.^g^^^^^ p^^d^^,3 

15 study of 15 patients w.th a 16 ^crodispenser. 

..ansferred to a ta.- ^^^^^^^^^ _,3d after 

FIGURE 100 is a MALDl mass 5P= 

a T. digest of a synthetic 20-merRNA. 

DEiAllfiO DESCRIPTION OF THE INVEMTION 
20 Definitions „ ,,,hnical and scientific temts used 

unless defned „„,,,3,„„, ^ <,„e of skill 

Heteln have - -j:^ Jlete penn^ted the suhiec. 

« .etein inco^ed ^ .mple" tefets to anv mate.- 

r „ «Mnoe human, animal, plant, bactena. 
oMained ft m ;v"^^;^*-^,,,„. ..ological sample will 
rc;rr.rair:: — . E.amp„s ». appt„p.ate 
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biologlcal samples include, but are not limited to: solid materials 
tissue, cell pellets, biopsies) and biological fluids (e^ urine, blood ' 
sal.va, amniotic fluid, mouth wash, cerebral spinal fluid and other bodv 
fluids). ^ 

5 As used herein, the phrases 'chain-elongating nucleotides' and 

Cham-terminating nucleotides' are used in accordance with their art 
recognized meaning. For example, for DNA, chain-elongating nucleotides 
include 2'deoxyribonucleotides (e^ dATP, dCTP, dGTP and dTTP) and 
cham-temiinating nucleotides include 2', 3'-dideoxyribonucleotides /e a 
10 ddATP,ddCTP,ddGTP,ddTTP). For RNA, chain-elongating nucleoti^' 
•nclude ribonucleotides fe^ ATJP, CTP, GTP and UTP) and chain- 
terminating nucleotides include 3'-deoxyribonucleotides (e^ 3'dA 
3'dC, 3'dG and 3'dU). A complete set of chain elongating nucleotides 
refers to dATP, dCTP, dGTP and dTTP. The teon 'nucleotide' is also 
15 well known In the art. 

As used herein, nucleotides include nucleoside mono- di- and 
tiiphosphates. Nucleotides also include modified nucleotides such as 
Phosphorotiiioate nucleotides and deazapurine nucleotides. A complete 
set of chain-elongating nucleotides refers to four different nucleotides 
that can hybridize to each of ti,e four different bases comprising tiie DNA 
template. 

As used herein, the superscript O-i designates i + 1 mass 
differentiated nucleotides, primers or tags. In some instances, the 
superscript O can designate an unmodified species of a particular 
25 reactant, and the superscript i can designate the i-th mass-modified 
species of that reactant. If, for example, more than one species of 
nucleic acids are to be concuaentiy detected, tiien i + 1 different 
mass-modified detector oligonucleotides (D , D', . . . o*) can be used to 
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distinguish each species of mass modified detector oligonucleotides (D) 
from the others by mass. spectrometry. 

As used herein, "multiplexing" refers to the simultaneously 
detection of more than one analyte, such as more than one (mutated) loci 
5 on a particular captured nucleic acid fragment (on one spot of ah array). 
As used herein, the term "nucleic acid" refers to single-stranded 
and/or double-stranded polynucleotides such as deoxyribonucleic acid 
(DNA), and ribonucleic acid (RNA) as well as analogs or-derh/'atives of 
either RNA or DNA. Also included in the term 'nucleic acid' are analogs 
10 of nucleic acids such as peptide nucleic acid (PNA), phosphorothioate 
DNA, and other such analogs and derivatives. 

As used herein, the term "conjugated" refers stable attachment, 
preferably ionic or covalent attachment. Among preferred conjugation 
means are: streptavidin- or avidin- to biotin interaction; hydrophobic 
IS interaction; magnetic interaction (e.g.. using functionalized magnetic 
. beads, such as DYNABEADS, which are streptavidin-coated magnetic 
beads sold by Oynal, Inc. Great Neck, NY and Oslo Norway); polar 
interactions, such as "wetting" associations between two polar surfaces 
or between oligo/polyethylene glycol; formation of a covalent bond, such 
20 as an amide bond, disulfide bond, thioether bond, or via crosslinking 
agents; and via an acid-labile or photocleavable linker: 

As used herein equivalent, when referring to two sequences of 
nucleic acids means that the two sequences in question encode the same 
sequence of amino acids or equivalent- proteins. When "equivalent" is 
25 used in referring to two proteins or peptides, it means that the two 
proteins or peptides have substantially the same amino acid sequence 
with only conservative amino acid substitutions that do not substantially 
alter the activity r function of the protein or peptide. Wh n 
"equivalent" r fers to a property, the property does not need to be 
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present to the same extent [e^., two peptides can exhibit different rates 
of the same type of enzymatic activity], but the activities are preferably 
substantially the same. "Complementary/ when referring to two 
nucleotide sequences, means that the two sequences of nucleotides are 
5 capable of hybridizing, preferably with less than 25%, more preferably 
with less than 15%, even more preferably with less than 5%, most 
preferably with no mismatches between opposed nucleotides. Preferably 
the two molecules will hybridize under conditions of high stringency. 
As used herein: stringency of hybridization in determining 
10 percentage mismatch are those conditions understood by those of skill in 
the art and typically are substantially equivalent to the following: 

1) high stringency: 0,1 x SSPE, 0.1% SDS, 65<*C 

2) medium stringency: 0.2 x SSPE, 0,1 % SDS, 50«C 

3) low stringency: 1 .0 x SSPE, 0.1 % SDS, 50^C 

15 It is understood that equivalent stringencies may be achieved using 
alternative buffers, salts and temperatures. 

As used herein, a primer when set forth in the claims refers to a 
primer suitable for mass spectrometric methods requiring immobilizing, 
hybridizing, strand displacement, sequencing mass spectrometry refers to 

20 a nucleic acid must be of low enough mass, typically about 70 

nucleotides or less than 70, and of sufficient size to be useful in the 
mass spectrometric metiiods described herein that rely on mass 
spectrometric detection. These methods include primers for detection 
and seequening of nucleic acids, which require a sufficient number 

25 nucleotides to from a stable duplex, typically about 6-30, preferably 

about 10-25, more preferably about 12-20, Thus, for purposes herein a 
primer will be a sequence of nucleotides comprising about 6-70, more 
pref rably a 1 2-70, more preferably greater than about 1 4 to an upper 
limit of 70, depending upon sequence and application of the primer. The 
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primers herein, for example for mutational analyses, are selected to be 
upstream of loci useful for diagnosis such that when performing using 
sequencing up to or through the site of interest, the resulting fragment is 
of a mass that sufficient and not too large to be detected by mass 
5 spectrometry. For mass spectrometric methods, mass tags or modifier 
are preferably included at the 5'-end, and the primer is otherwise 
unlabeled. 

As used herein, -conditioning" of a nucleic acid refers to 
modification of the phosphodiester backbone of the nucleic acid molecule 
10 (e^, cation exchange) for the purpose of eliminating peak broadening 
due to a heterogeneity in the cations bound per nucleotide unit. 
Contacting a nucleic acid molecule with an alkylating agent such as 
akyliodide, iodoacetamide, )9-iodoethanol, or 2,3-epoxy-1-propanol, the 
monothio phosphodiester bonds of a nucleic acid molecule can be 
15 transformed into a phosphotriester bond. Likewise, phosphodiester 
-bonds may be transformed to uncharged derivatives employing 
Irialkylsilyl chlorides. Further conditioning involves incorporating 
nucleotides that reduce sensitivity for depurination (fragmentation during 
MS) e^ a purine analog such as N7- or N9-deazapurine nucleotides, or 
20 RNA building blocks or using oligonucleotide triesters or incorporating 
phosphorothioate functions that are alkylated or employing 
oligonucleotide mimetics such as peptide nucleic acid (PNA). 

As used herein, substrate refers to an insoluble support onto 
which a sample is deposited according to the materials described herein. 
25 Examples of appropriate substrates include beads (e.g., silica gel, 

controlled pore glass, magnetic, agaroase gele and crosslinked dextroses 
(k^ Sepharose and Sephad x, cellulose and other materials known by 
those of skill in the art to serve as solid support matrices. 
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For examples substrates may be formed from any or combitions of: silica 
gel, glass, magnet, polystyrene/1 % divinylbenzene resins, such as Wang 
resins, which are Fmoc-amino acid-4-(hydroxymethyl)phenoxymethyl- 
copoly(styrene-1% divinylbenzene (DVD)) resin, chlorotrityl 
5 (2-chlorotritylchloride copolystyrene-DVB resin) resin, Merrifield 
(chloromethylated copolystyrene-DVB) resin metal, plastic, cellulose, 
cross-linked dextrans, such as those sold under the tradename Sephadex 
(Pharmacia) and agarose gel, such as gels sold under the tradename 
Sepharose (Pharmacia), which is a hydrogen bonded polysaccharide-type 

10 agarose gel, and other such resins and solid phase supports known to 
those of skill in the art. The support matrices may be in any shape or 
form, including, but not limited to: capillaries, flat supports such as glass 
fiber filters, glass surfaces, metal surfaces (steel, gold, silver, aluminum, 
copper and silicon), plastic materials including multiwell plates or 

15 membranes (e.g., of polyethylene, polypropylene, polyamide, 
polyvinylidenedifluoride), pins (e-g., arrays of pins suitable for 
combinatorial synthesis or analysis or beads in pits of flat surfaces such 
as wafers (e.g., silicon wafers) with or without plates, and beads. 

As used herein, a selectively cleavable linker is a linker that is 

20 cleaved under selected conditions, such as a photocleavable linker, a 
chemically cleavable linker and an enzymatically cleavable linker (i.e., a 
restriction endonuclease site or a ribonucleotide/RNase digestion). The 
linker is interposed between the support and immobilized DNA. 
Isolation of nucleic acids molecules 

25 Nucleic acid molecules can be isolated from a particular biological 

sample using any of a number of procedures, which are well-known in 
the art, the particular isolation procedure chosen being appropriate for 
th particular biological sample. For example, freeze-thaw and alkaline 
lysis procedures can be useful for obtaining nucleic acid molecul s from 
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solid materials; heat and alkaline lysis procedures can be useful for 
obtaining nucleic acid molecules from urine; and proteinase K extraction 
can be used to obtain nucleic acid from blood (see, e.o.. Rolff et al 
(1994) PGR: Clinical Diagnostics and Research, Springer). 
5 To obtain an appropriate quantity of a nucleic acid molecules on 

which to perform mass spectrometry, amplification may be necessary. 
Examples of appropriate amplification procedures for use herein 
include: cloning (Sambrook si aL, Molecular Cloning; A-Lahnrfttftfy 
MaouaL Cold Spring Harbor Laboratory Press, 1989), polymerase chain 
10 reaction (PCR) (C.R. Newton and A. Graham, PCR, BIOS Publishers, 
1 994), iigase chain reaction (LCR) (see, e.g. . Weidmann et SL (1 994) 
PCR Methods AppI Vol. 3, Pp. 57-64; F. Barany (1991) Proc. Natl 
Acad. Spi. U.S.A. g8: 189-93), strand displacement amplification (SDA) 
(see, .e.g.. Walker et aL d 994) Nucleic Acids Ras 22:2670-77) and 
1 5 variations such as RT-PCR (see, e.o.. Higuchi et gL (1 993) 

.Bio/Technology JJ.: 1 026-1 030), allele-specific amplification (ASA) and 
transcription based processes. 

Immobilization of nucleic acid molecules to solid supports 
To facilitate mass spectrometric analysis, a nucleic acid molecule 
20 containing a nucleic acid sequence to be detected can be immobilized to 
an insoluble (i.e., a solid) support. Examples of appropriate solid 
supports include beads (e.g., silica gel, controlled pore glass, magnetic. 
Sephadex/Sepharose, ceHulose), capillaries, flat supports such as glass 
fiber filters, glass surfaces, metal surfaces (steel, gold, silver, aluminum, 
25 copper and silicon), plastic materials including multiwell plates or 
membranes (e.g.. of polyethylene, polypropylene, polyamide, 
poiyvinylidenedifluoride), pins (e.g. . arrays of pins suitable for 
c mbinatorial synthesis r analysis or beads in pits of flat surfaces such 
as wafers (e^, silicon wafers) with or without filter plates. 
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Samples containing target nucleic acids can be transferred to solid 
supports by any of a variety of methods known to those of skill in the 
art. For example, nucleic acid samples can be transferred to individual 
wells of a substrate, e.g.. silicon chip, manually or using a pintool 
5 microdispenser apparatus as described herein. Alternatively, a 

piezoelectric pipette apparatus can be used to transfer small nanpltter 
samples to a substrate permitting the performance of high throughput 
miniaturized diagnostics on a chip. 

Immobilization can be accomplished, for example, based on 

10 hybridization between a capture nucleic acid sequence, which has 

already been immobilized to the support and a complementary nucleic 
acid sequence, which is also contained witiiin the nucleic acid molecule 
containing the nucleic acid sequence to be detected (FIGURE 1 A). So 
that hybridization between the complementary nucleic acid molecules is 

15 not hindered by the support, the capture nucleic acid can include an e.g. . 
spacer region of at least about five nucleotides in length between the 
. solid support and the capture nucleic acid sequence. The duplex formed 
will be cleaved under the influence of the laser pulse and desorption can 
be initiated. The solid support-bound nucleic acid molecule can be 

20 presented through natural oligoribo- or oligodeoxyribonucleotide as well 
as analogs (e.g., thio-modified phosphodiester or phoisphotriester 
backbone) or employing oligonucleotide mimetics such as PNA analogs 
(see, e.g., Nielsen eta/.. Science 254 :1497 (1991)) which render the 
base sequence less susceptible to enzymatic degradation and -bound 

25 capture base sequence. 
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Linkers 

A target detection site can be directly linked to a solid support via 
a reversible or Irreversible bond between an appropriate functionality (L') 
on the target nucleic acid molecule (T) and an appropriate functionality 
5 (U on the capture molecule (FIGURE 1 B). A reversible linkage can be 
such that It is cleaved under the conditions of mass spectrometry (i.e., a 
photocleavable bond such as a charge transfer complex or a labile bond 
being formed between relatively stable organic radicals): 

Photocleavable linkers are linkers that are cleaved upon exposure 
10 to light (see, e^, Goldmacher et aL (1992) BiocorjL_eheau 3. 104-107). 
thereby releasing the targeted agent upon exposure to light. 
Photocleavable linkers that are cleaved upon exposure to light are known 
(see, e^ Hazum sj aL (1981) in Peot.. Prnn P.. r. p^nt Qy -p , Ifith 
Brunfeldt. K (Ed), pp. 105-1 10, which describes the use of a nitrobenzyl 
1 5 group as a photocleavable protective group for cysteine; Yen ej aL 
(1989) Makromol. Chem 19Q:69-82, which describes water soluble 
photocleavable copolymers, including hydroxypropylmethacrylamide 
copolymer, glycine copolymer, fluorescein copolymer and 
methyfrhodamine copolymer; Goldmacher gj aL (1992) Bioconi. nh^n, 
20 2:104-107. which describes a cross-linker and reagent that undergoes 
photolytic degradation upon exposure to near UV light (350 nm); and 
Senter alaL (1985) Photochem. Photohiol 42:231-237, which describes 
nitrobenzyloxycarbonyl chloride cross linking reagents that produce 
photocleavable linkages), thereby releasing the targeted agent upon 
25 exposure to light. In preferred embodiments, the nucleic acid is 

ImmoWnzed using the photocleavable linker moiety that is cleav d during 
mass spectrometry. Pr sently pr f n- d photocleavable link rs are set 
forth in the EXAMPLES. 
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Furthermore, the linkage can be formed with I - k • 

~ . ...... 

-PPor, cmes negative charges which rape, «,e nagatfveV rha' T 
, nucelc acia haCbcna and «,„3 facKtee «.c dascpL 127, 
B a„a,va.sh.a™ssspec.r„™,a. Daso,p«„„ can rcJ: ^ 
hea, created by *a laser pube and/or. dapendino on L 1 
-so,p«„„ „, ,^ resona Je"::!":/"^""^ 
chromophore. 

Thus, the L-L' chemistry can be of a type of di..^if w u 
10 ,che.ica..v Ceavab-e. ,or e.a.p,e. h. .ercrolarrdl"" 
a b«o«n/a.ep«v,d,n ayste™. a he.eroh,h,„c«c„a, T 
e*er group ,sae, ^ KaaterstaL (1990) -A JZ T ° ' ""^ 
for Modificaticn of Synthetic Bi^ ■ . A^d-Labile Unker 

iL-'035) that can be cleaved under mlldiv .^i,. Z^. 
- cond^lona o, „aaa apec.ron»:::ru::; X^^^^^^^^^ ^ 

a-moa. neutral conditions with a hydrazlniunVacetat! bu«e al ' 
arg,n.na.arainlne or lysine-lysine bond cleavable h„ 7 
^ .i.e trypsin or a pyrophosphate ZIZZZT::;: 

e,^. wh,ch can be cleaved, for exampte. by a ribonuclease or alKali 

con,p.e^a^rTT' ' ^ = 

cases I - H -"V 
cases «,e cherge-transfer band- can be determined by Uv/vis 

.pec.ron,e.ry,sae.^S,g,,,i^^,,,g^,j^^^ 
Foster. Academic Press. 

corresponding energy o, the charge-transfer wavelength ^nd ^ 
specific deson,«on of, th solid support can be initiator " 
•he art will recooni,. «... . oe initiated. Those skilled in 

will recogn«e the, s v ral combinations can serve this purpose 
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and that the donor functionality can be either on the solid support or 
coupled to the nucleic acid molecule to be detected or vice versa. 

In yet another approach, a reversible L-L' linkage can be generated 
by homolytically forming relatively stable radicals. Under the influence of 
5 the laser pulse, desorption (as discussed above) as well as ionization will 
take place at the radical position. Those skilled in the art will recognize 
that other organic radicals can be selected and that, in relation to the 
dissociation energies needed to homolytically cleave the bond between 
them, a corresponding laser wavelength can be selected (see e.g.. 
Reactive Molecules by C. Wentrup, John Wiley & Sons, 1984). 

An anchoring function L' can also be incorporated into a target 
capturing sequence (TCS) by using appropriate primers during an 
amplification procedure, such as PGR (FIGURE 4), LCR (FIGURE 5) or 
transcription amplification (FIGURE 6A). 

When performing exonuclease sequencing using MALDI-TOF MS, a 
single stranded DNA molecule immobilized via its 5-end to a solid support 
is unilaterally degraded with a 3'-processive exonuclease and the 
molecular weight of the degraded nucleotide is determined sequentially. 
Reverse Sanger sequencing reveals the nucleotide sequence of the 
immobilized DNA. By adding a selectively cleavable linker, not only can 
the mass of the free nucleotides be determined but also, upon removal of 
the nucleotides by washing, the mass of the remaining fragment can be 
detected by MALDI-TOF upon cleaving the DNA from the solid support. 
Using selectively cleavable linkers, such as the photocleavable and 
chemical cleavable linkers provided herein, this cleavage can be selected 
to occur during the ionization and volatizing steps of MALDI-TOF. 
The sam rationale applies for a 5' immobilized strand of a double 
stranded DNA that is degraded whil in a duplex. Likewise, this also 



wo 98/20166 



PCr/US97/20444 



-50- 



applies when using a 5'-processive exonuclease and the DNA is 
immobilized through the 3'-end to the solid support. 

As noted, at least three version of immobilization «ro 

B sequence or ^noun^„, 3e,uence „us, be ^^Z,^ 

*rouaH .„,cage ,o one predetenn,„a<. suan.. «« p,,^ 
e.,m,„=,e *e duptex and ««„ a high concentration „, a con,nlt 
PHn^r or O.. ^ .en«. „p«rean, .o. .e .r,e, irT^^ . 
strand d.splacen«n, occurs and .he pd„,er is hybddized to tf,e 

fmniobilized strand. 

on Th" '""^'^"'^ *^ P^-- -Ceic acid is in,n«bi,ized 

on so-d support and the target nuCeic acid is hybridized thereto 

a. r; T T""' """"" """^ - 

a, *e 5 .„d so «,a, free 3--OH is avaiiable ,or nucleic acid syn«.esis 

<exte„s„n, and the sepuence o, the -hybridized- targe. DNA carr 

deter^ned because *e hybridized .en,pia.e can be ren,oyed by 
dena«,raaon and *e extended DNA products cteaved from the solid 
support ,or MAU.-TOP .S. Similarly ,„r 3,. «.e Immobilized ^rA strand 

an .mmob,l,zed pnmer of a known, upsfreamn DNA sequence 

complementary to an Invariable region of a target seouence Tl,. . • 
add from the person is obtained and the DNA seguel oTa v^Z 
' 9..n .deletion, insertion, miss nse n,uta«on «,at cause genel 
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predisposition or diseases, or the presence of virai/bacterial or fungal 
DNA) not only is detected, but the actual sequence and position of the 
mutation is also determined. 

In other cases, the target DNA must be immobilized and the primer 
5 annealed. This requires amplifying a larger DNA based on known 
sequence and then sequencing the immobilized fragments (i.e., the 
extended fragments are hybridized but not immobilized to the support as 
described above). In these cases, it is not desirable tolnclude a linker 
because the MALOI-TOF spectrum is of the hybridized DNA; It is not 
10 necessary to cleave the immobilized template. 

Any linker known to those of skill In the art for immobilizing 
nucleic acids to solid supports may be used herein to link the nucleic acid 
to a solid support. The preferred linkers herein are the selectively 
cleavable linkers, particularly those exemplified herein. Other linkers 
include, acid cleavable linkers, such as bismaleimideothoxy propane, 
acid-labile trityl linkers. 

Acid cleavable linkers, photocleavable and heat sensitive linkers 
may also be used, particulariy where it may be necessary to cleave the 
targeted agent to permit it to be more readily accessible to reaction. 
Add cleavable finkers Include, but are not limited to, bismaleimideothoxy 
propane; and adipic acid dihydrazide linkers (see, e.g.. Fattom et aK 
<1 992) Infection & Immun. ^:584-589) and acid labHe transferrin 
conjugates that contain a sufficient portion of transferrin to pemiit entry 
into the intracellular transferrin cycling pathway (see, e.g.. Welhoner gt 
25 aL (1 99 1) J. Biol. Chem. 266:4309-43 1 4) . 



15 



20 
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Photodeavable Linkers 



10 



IS 



Photocleavable linkers ere provided. I„ parteuier, photocleavabl. 

n ers as «,e,r pHosp.„ran.id,.e deriva^Ves are provided L ^ZT. 

Phase syndesis o, o«8onuc,eo«des. The linker. oon»ln o-nitrob^nr 

J.e«es and pHospHate linkages wbich allo» ,or oo.plete pholi^^ 

cleavage o, «,e conjugates wl*in .inutes upon UV lrradia.L Te uv 

wavelenB«,s used are selected so .hat the irradiation wil. no, d 

oligonucleotides and are preferrabiy about 350-380 nl ' 

365 nn,. rUe photoCeavabie iinkeL pro Jed hrrro; 
cousin, e„.,„.,__^.„_^;;~ 

™~ (aee. Sinha « ^ ,,933, 1,,,,!,,,,^ 2^=6843-5846. 

<1993, 1,,,,!^^,e,23.6,94; and Metteucci^^ ,,98,, 
^2be!rL_Soc, 103:3185-3191). >> ^L-aHL, 

In one embodiment, the photocleavable linkers have formula I: 



20 



25 




(1) 



, " "^'^-^'^'-^'^xVa-M or ..hydroxvalkvl; is 
Z7y. R^" : «^o-<Vcarbonv. and 

« IS alkyi, alkoxy, aryl or aryloxy. 

for.u.r,l! ""'"^ '"^ -"•-'--'"^ have 
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5 



10 




where R*" is <(;-(4,4'-dimethoxytrityloxy)alkyl, «/-hydroxyalkyl or alkyi; R*' 
is selected from hydrogen, alkyi, aryl, alkoxycarbonyi, aryloxycarbonyl 
15 and carboxy; R" is hydrogen'or (c^ialkylamino)(<c^<;yanoalkoxy}P-; and X**' 
is hydrogen, alkyi or OR**. 

In particularly preferred embodiments, R^ is 3-(4,4'- 
dimethoxytrityloxy)propyl, 3-hydroxypropyI or methyl; R^^ is selected 
from hydrogen, methyl and carboxy; R" is hydrogen or 
20 .(diisopropyIamino)(2-cyanoethoxy)P-; and X«> is hydrogen, methyl or 
OR**. In a more preferred embodiment, R^° is 3-(4,4'- 
dimethoxytrityloxy)propyl; R^' is methyl; R« is (diisopropylamino)(2- 
cyanoethoxy)P-; and X** is hydrogen. In another more preferred 
mbodiment, R** is methyl; R'^ is rhethyl; R*^ is {diisopropylamino)(2- 
25 cyanoethoxy)P-; and X*> is 3-(4,4'-dlmethoxyfrityloxy)propoxy. 

In another embodiment, the photocleavable linkers have formula 

III: 

30 



35 
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OR 



(III) 

23 



Where R- is hydrogen or (d.alkv,amino)«.,-oye„oa.k„xy,p:; 
detected fron. a^y.ro^^,.^. ^-i*A-^,n^^o.y.Zo:^y>ll I 
IS hydroxyelM and 6M4,4'^ime«,oxytrityloxylalkvl a J- 

suhad.^^ on d.e .,M or a.o.v rTw^ orto!: XTr" ' 
and a are each l„depende«,y CM; and R» e,M. a.^oZ^ r ^ ' 
aryoxy. In cer«in embodin»n.a, i. <.hydroxya.M or J4 4- 

30 rc:rr°~ --^ - - -v. cha. « 

In preferred embodiments. R« is hydrogen or (diia„propy,aminoK2 

' ' " ^-''V*<»<VPropoxy, S-i^T 

d.m«ho^„xy,propoxy. 4-hydroxybu,y,. 3-hydroxy- , -propy, 1 
hydroxy-a-propw. 3*ydroxy-2.„e«,W-,.propy,. a-hydroxy^ J 
25 •'ydroxymethyl,4^4,4-^imethoxy«,y,oxy)b«y| 3-(4 4- 

d.mett,oxy«,y,oxy,.,^,ropy,. 2.,4,4'^ime*oxyM.y,o;;v,e«,w ,-,44. 
d,n,e«,oxy«ty.oxy«.propy,. 3-,4.4-^ime«,oxy.iWoxW-2.meCM 
propyl and 4,4--dlmethyoxyo-ityloxyme*yl. . ^ ' 

In more preferred embodiments, H'^ is (diisopropylaminol(2- 
'0 JanoethoxylP-; rands are O; and R" is selected from 3-,4 4 

d«»*oxy.HtWoxy,propoxy. 4-,4.4--dimed,oxytH^„xv,bu.y;. 3. ,4 4-- 
*m .hoxytntylcxytpropyl. 2-(4.4-^ime.hoxy.ri.y|„xy,e.hy. ,.,4 4 
<..n.e.hoxyt,i,Woxy.2.ropy,. 3-,4.4-.ime«.oxy..y,:x,,;^ 
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propyl «K. 4.4-.dime*Voxv«Hv.oxvn,e*vl. R» >s most pcaferably 3- 
(4 4'-dimethoxytrityloxy)propoxy. 

Preparation of the photocleavable Hnkers 

A. Preparation of Photodeavable linkers of 
formulae I or II 

Pho«.c.eav,ble Unkers of fonnulae . or II may be prepared by *e 
„,e*ods described below, by n*»r .nodHica.on *<' "»«^»^ 
Choosing the appropriate starting n«tenals or by any other me«,ods 
Lown I those o, s«.. in the art. Detailed procedures tor the sv.thes,s 

10 of photocleavable Unkers of fom.ula II are provided in *= JxarM-- 

,„ the photocleavable We-s of fonnulall Where X».s hydrogen. 

the imlcers n«v be prepared in the following manner AIMation of 5- 
hydroxy-Z-nitrobcnzaldehyde ^ «- u^hydroxyalM hal,de 3- 
hydroxlpropyl bromide, followed by protec«on of «.e resulfng alcohol 
15 as ea„ a siM ether, provides a 6-«-silyl=xyalkoxy)-2- 

::„o^nza«Iyde. Addison of an organomeUllic to the aldehyde affords 
a benzylic alcohol. Organometallics which may be used .ndude 
trtallcylaluminums Ifor linlcers where R" is aM. as ^ 
trlmethylaluminum, borohydHdes (for linkers where ,s hydrogen,, such 
20 as sodium borohydnde. or metal cyanides (for linkers where R -s 

carboxy or alkoxycarbonyl.. such as potassium cyamde. In the case o, 
the metal cyank.es. ^ pr«.oct o, «.e re,c«on. a wouM^ 
then be hydrdyzed under either acWic or basic oond,t.ons ,n the presence 

i^-^K«i tn afford the compounds of interest, 
of either water or an alcohol to atioro m k 

25 The silyl grcup of «te side chain o. the resulting benzyUc alcohols 

.«y then be exchanged for a 4.4--dime«.oxytriyl group by desilylation 

with. ^ tetrabutylammonium fluoride, to give the corresponding 

ale hol. followed by reac«on W«h 4.4-^ime,hoxytr,.yl chlonde. Reacfon 
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With, ^ 2-cyanoethyl diisopropylchlorophosphoramidite afford 
linkers where R« io i - -omiaite affords the 

s wmere R is (<lralkylamino)(4M:yanoalkoxy|P- 
A specific example of a synthesis of a photocieavahlo r . 
-fa^uia ,s Shown ,n *e foiiowin. schen«. Ich lTI 
S "-Of the (inker in Oligonucleotide ay„«,esis Thl. r 

illus^ative only and l„ no way ,i.L Tsoo^^T"' '"'^""^ 
Experin,en.al deteils of *ese syndetic ,ra„^ 

«.e Exanvies. •'^''^ormanons are provided in 
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HO 




NO 



cat. KI 



CHO 
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NO^ fmidazole 



CHO 
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Synthesis of the linkers of formula II where X^" is OR«>. 3,4- 
dihydroxyacetophenone is protected selectively at the 4-hydroxyl by 
20 reaction with, e^ potassium carbonate and a silyl chloride. Benzoate 
esteres, propiophenones, butyrophenones, etc. may be used in place of 
the acetophenone. The resulting 4-silyloxy-3-hydroxyacetophenone is 
then alkylated at the with an alkyi halide (for linkers where is alkyi) at 
tiie 3-hydroxyl and desilylated with, e^ tetrabuylammonium fluoride to 
25 afford a 3-alkoxy-4-hydroxyacetophenone. This compound is then 
alkylated at the 4-hydroxyl by reaction with an a»-hydroxyalkyl halide, 

3-hydroxypropyl bromide, to give a 4-(a,-hydroxyalkoxy)-3- 
alkoxyacetophenone. The side chain alcohol is then protected as an 
ester, an acetate. This compound is then nitrated at the 5-position 
30 with, e^ concentrated nitric acid to provide the corresponding 2- 
nitroacetophenones. Saponification of the side chain ester with, e^ 
potassium carbonate, and reduction of the ketone with. e^. sodium 
borohydride, in either order gives a 2-nitro-4-(a;-hydroxyalkoxy)-5- 
alkoxyb nzyjic alcohol. 
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Selective protection of the side chain alcohol as the .r. 

d.methoxytrity| chloride. Further reaction with e a 9 . 

witn, e^, 2-cyanoethvl 
dusopropyfchlorophosphoramidite affords the linkers where R« ; 
(dialkylamino)(a/-cyanoalkoxy)P-. 

,„ .T""" «" = Photocleavable linker o, 

formuta II b shown «,e following scheme. TOs schen» - ■ 

«.u..3«.e ... en. in no wev .he scope ITZ '^ZZlT 

OH 



HO 




AcO 
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B. Preparation of photodeavable linkers of 
formula III 

Photodeavable linkers of formula ill may be prepared by the 
methods described below, by minor modification of the methods by 
5 choosing appropriate starting materials, or by other methods known to 

those of skill in the art. 

In general, photodeavable linkers of formula III are prepared from 
6(/-hydroxyalkyl- or alkoxyaryl compounds, in particular ft;-hydroxy-alkyl or 
alkoxy-benzenes. These compounds are commercially available, or may 
10 be prepared from an w-hydroxyalkyi halide (e^ 3-hydroxy propyl 

bromide) and either phenyllithium <for the a/-hydroxyalkylbenzenes) or 
phenol <for the oA-hydroxyalkoxybenzenes). Acylation of the w-hydroxyl 
group (e.g.. as an acetate ester) followed by Friedel-Crafts acylation of 
the aromatic ring with 2-nitrobenzoyl chloride provides a 4-(a/-acetoxy- 
15 alkyi or alkoxy)-2-nitrobenzophenone. Reduction of the ketone with, 
e.g.. sodium borohydride, and saponification of the side chain ester are 
performed in eitiier order to afford a 2-nitrophenyl-4-(hydroxy-alkyl or 
alkoxy)phenylmethanol. Protection of the terminal hydroxyl group as the 
: corresponding 4,4'-dimethoxytrityl ether is achieved by reaction with 
20 4,4'-dimethoxytrityl chloride. The benzylic hydroxyl group is then 
reacted with, e.g.. 2-cyanoethyl diisopropylchlorophosphoramidite to 
afford linkers of fomiula II where R^^ is (dialkylamino){ft/-cyanoalkoxy)P-. 

Other photodeavable linkers of formula III may be prepared by 
substituting 2-phenyl-1-propano! or 2-phenylmethyl-1-propanol for the w- 
25 hydroxy-alkyi or alkoxy-benzenes in the above synthesis. These 

compounds are commercially available, but may also be prepared by 
reaction of, e.g.. phenylmagnesium bromide or benzylmagnesium 
bromide, with the requisit oxirane (Le^ propylene oxide) in the presence 
of catalytic cuprous ion. 
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Chemically deavable linkers 
A variety of chemically deavable linkers may be used to introduce 
a deavable bond between the immobilized nucleic acid and the solid 
support. Acid-labile linkers are presently preferred chemically deavable 
5 linkers for mass spectrometry, especially MALDI-TOF MS, because the 
add labile bond is cleaved during conditioning of the nucleic acid upon 
addition of the 3-HPA matrix solution. The acid labile bond can be 
introduced as a separate linker group, e.g.. the acid labile trityl groups 
(see Rgure 68; Example 16) or may be incorporated in a syntfietic 
10 nucleic add linker by introducing one or more silyl Internudeoside bridges 
using diisopropylsilyl, thereby forming diisopropyfsilyl-linked 
oligonucleotide analogs. The diisopropylsilyl bridge replaces the 
phoshodiester bond in the DNA backbone and under mildly acidic 
conditions, such as 1.5% trifluoroacetic acid (TFA) or 3-HPA/1 % TFA 
15 MALDI-TOF matrix solution, results in the introduction of one or more 
intra-strand breaks In the DNA molecule. Methods for the preparation of 
diisopropylsilyl-linked oligonucleotide precursors and analogs are known 
to those of skill in the art (see e^, Saha et aL (1 993) J. Org. Cham 
§2:7827-7831). These oligonucleotide analogs may be readily prepared 
20 using solid state oligonucleotide synthesis methods using diisopropylsilyl 
derivatized deoxyribonucleosides. 
Nudeic add conditioning 

Prior to mass spectrometric analysis, it may be useful to 
"condition" nucleic add molecules, for example to decrease the laser 
25 energy required for volatilization and/or to minimize fragmentation. 
Conditioning is preferably performed while a target detection she is 
immobilized. An xample of conditioning is modification of the 
phosphodiester backbone of the nuci ic acid molecule (e.g.. cation 
xchange), which can be useful for eliminating peak broadening due to a 
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heterogeneity in the cations bound per nucleotide unit. Contacting a 
nucleic acid molecule with an alkylating agent such as akyliodide, 
iodoacetamide, )?-iodoethanol, or 2, 3-epoxy-1 -propanoic the monothio 
phosphodiester bonds of a nucleic acid molecule can be transformed into 
5 a phosphotriester bond. Likewise, phosphodiester bonds may be 
transformed to uncharged derivatives employing trtalkylsilyl chlorides. 
Further conditioning involves incorporating nucleotides that reduce 
sensitivity for depurination (fragmentation during MS) e.a,. a purine 
analog such as N7- or N9-deazapurine nucleotides, or RNA building 
10 blocks or using oligonucleotide triesters or incorporating 

phosphorothioate functions which are alkylated or employing 
oligonucleotide mimetics such as PIMA. 
Multiplex reactions 

For certain applications, it may be useful to simultaneously detect 
1 5 more than one (mutated) loci on a particular captured nucleic acid 

fragment (on one spot of an array) or it may be useful to perform parallel 
processing by using oligonucleotide or oligonucleotide mimetic arrays on 
various solid supports. "Multiplexing" can be achieved by several 
different methodologies. For example, several mutations can be 
20 simultaneously detected on one target sequence by employing 

corresponding detector (probe) molecules (e.a„ oligonucleotides or 
oligonucleotide mimetics). The molecular weight differences between the 
detector oligonucleotides D1, D2 and D3 must be large enough so that 
simultaneous detection (multiplexing) is possible. This can be achieved 
25 either by the sequence itself (composition or length) or by the 

introduction of mass-modifying functionalities M1-M3 into the detector 
oligonucleotide (seel FIGURE 2). 
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Mass modification of nucleic acids 

Mass modifying moieties can be attached, for instance, to either 
the S'-end of the oligonucleotide (M^), to the nucleobase (or bases) (M^ 
M\ to the phosphate backbone (M'), and to the 2'-position of the 
5 nucleoside (nucleosides) (M*, M«) and/or to the temiinal 3'.posltion (M«) 
Examples of mass modifying moieties include, for example, a halogen, an 
azido, or of the type, XR, wherein X is a linking group and R is a 
mass-modifying functionality. The mass-modifying functionality can thus 
be used to introduce defined mass increments into the oligonucleotide 
10 molecule. 

The mass-modifying functionality can be located at different 
positions within the nucleotide moiety (see, U.S. Patent No. 

5,547,835 and International PCT application No. WO 94/21822). For 
example, the mass-modifying moiety, M, can be attached either to the 
15 nucleobase, (in case of the c' -deazanucleosides also to C-7, M') to 
-tfie triphosphate group at the alpha phosphate, M^ or to the 2'-position 
Of the sugar ring of the nucleoside triphosphate, and M«. 
Modifications introduced at the phosphodiester bond (M4), such as with 
alpha-thio nucleoside triphosphates, have the advantage that these 
20 modifications do not interfere with accurate Watson-Crick base-pairing 
and additionally allow for the one-step post-synthetic site-specific 
modification of the complete nucleic acid molecule ^ via alkylation 
reactions (see, Nakamaye it aL (1 988)Nucl. Arirf. p^. 16:9947- 
59). Particularty preferred mass-modifying functionalities are boron- 
25 modified nucleic acids since they are better Incorporated into nucleic 
acids by polymerases (see, Porter et bL (1 995) Biosbemistry 
M:11963-11969; Hasan _EaL (1996) rNkideic.Aads^ 24:21 50- 
2157; U et aL (1995) Nucl. Acids R^^ o^-^/iqc ^gp^j 
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Furthermore. the mass-modifying functionaltty can be added so as 
to affect chain termination, such as by attaching It to the 3'-position of 
the sugar ring in the nucleoside triphosphate, M^ For those skilled in the 
art, it is clear that many combinations can be used in the methods 
5 provided herein. In the same way. those skilled In the art will recognize 
that chain-elongating nucleoside triphosphates can also be mass-modified 
in a similar fashion with numerous variations and combinations in 
functionality and attachment positions. 

Without being bound to any particular theory, the mass- 
10 modification. M. can be introduced for X in XR as well as using 
oligo-Zpolyethylene glycol derivatives for R. The mass-modifying 
increment in this case is 44, i.e. five different mass-modified species can 
be generated by just changing m from 0 to 4 thus adding mass units of 
45 (m=0). 89 (m = l). 133 (m = 2), 177 (m = 3) and 221 (m = 4) to the 
1 5 nucleic acid molecule (e^ detector oligonucleotide (O) or the nucleoside 
triphosphates (FIGURE 6(C)), respectively). The oligo/polyethylene 
glycols can also be monoalkylated by a lower alkyi such as methyl, ethyl, 
propyl, isopropyl. t-butyl and the like. A selection of linking 
. functionalities. X. are also illustrated. Other chemistries can be used in 
20 the mass-modified compounds (see. e^ those described in 

nn^onnH^ntides r-^ A»»inn..>.s;. A Practical Approach, F. Eckstein, 

editor, IRL Press, Oxford, 1991). 

In yet another embodiment, various mass-modifying functionalities. 
R, other than oligo/polyethylene glycols, can be selected and attached via 
25 appropriate linking chemistries. X. A simple mass-modification can be 
achieved by substituting H for halogens like F. CI. Br and/or I, or 
pseudohalogens such as CN, SCN. NCS, or by using different alkyl. aryl 
or aralkyi moieties such as methyl, ethyl, propyl, isopropyl, t-butyl. hexyl. 
phenyl, substituted phenyl, benzyl, or functional groups such as CH^F. 
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^ , «(CH 1 (CHJ. S«CH,)(C^.)2. Si( W.- Yet 

3no*er ^.ecute te^ de.ec«.r (D). or 

ne.er.pep«d« >„ ,enera*« n^ss- 

„„c,eosi<.e -HPHOS^'- J J„. ,^ „, 
e ^-"-^ =--;'-^X,„..«o„s o, 74 .r-t. .=0,. 13, .r-l, 
'88(r=t.m=2).24Sl _^^,,^,„„3-„, 74 (r=,. 

""O-""- - ^ ---^^ „e,r.4. .=0,. etc. are 

r retec an possWe variarUs/muUnU simuttaneously 
simultaneously detect P ^^3tlona at the site of a 

rre:^:v ir;- - -^.0.. a -etector 
,5 mut«.oo c«, be detect J ^ ^^^^^ 

.„.,aonucleo«de. so that „„d,„side 

^ varying -^07^^, "::^, mass modir.cations also can be 
triphosphates (FIGURE 6C). 

incorporated dudn. .be '-^TS^^''^^'^ 'o"-- '" ^ 
«GURE 3 sbowa ' J „,„„,speoi«c capture 

aiWeren^adon is ^^^'J^^J, ^ «« 

r "Ct^r ■; drr-e. Uces t, 



Ml-Mn. 
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Mass spectrometric methods for sequencing DNA 
Amenable mass spectrometric formats for use herein include the 
ionization (0 techniques, such as matrix assisted laser desorption 
ionization (MALDl). electrospray (ESI) continuous or pulsed); and 
5 related methods (e^ lonspray.Thermospray. Fast Atomic 

Bombardment), and massive cluster impact (MCI); these ion sources can 
be matched with detection fom^ats including lin-linear folds) fme-of- 
flight (TOR, single or multiple quadruple, single or multiple magnetic 
sector. Fourier transform ion cyclotron resonance (FTICR), ion trap, or 
10 combinations of these to give a hybrid detector (^ ion trap - t.me of 
flight). For ionization, numerous matrix/wavelength comb.nat.ons 
including frozen ana.yte preparation (MALDI) or solvent combinafons 

(ESI) can be employed. ...at 
Since a normal DNA molecule includes .our nudeoOde unrts (A. T, 
15 c G), and the mass of each of *ese U unique (monolsotoplc masses 
sis 06. 304.05. 289.05, 329.05 Da, respectively), an accurate mass 
. detem^nation can def,ne or cons«.n the possible --P"— ' 
«,a. DNA. only above 4900 Da does each unit molecular we-ght have a. 
least one allowabte composition; among al, B-mers there is only one non- 
20 unique ncn^> molecular weight, among 8-mers. 20. for *ese a^ 
iargar oiigonucleotidea. such mass overtaps can be rasolved wd. 

inO" (-10 part pe, million, ppm) mass accuracy available wrth h.gh 
resolution FTICB MS. For the 25Hner A.T„, the 20 composi^on 
degeneracies «rt»n measured at ±0.6 Da is reduced u> three (A.T». 
25 T.C„G„AT,C.G,J when measured with 2 ppm accuracy Given 

c;:;oLtion constraints d-a presence or absence o, one of «,e four 
bases in the strand) can reduce this further (sea below). 



wo 98001^ 



PCT/US97/20444 



-67- 

Medium resolution instrumentation, including but not exclusively 
curved field reflectron or delayed extraction time-of-flight MS 
instruments, can also result in improved DNA detection for sequencing or 
diagnostics. Either of these are capable of detecting a 9 Da (Am (A-T)) 
5 shift In ^30-mer strands generated from, for example primer oligo base 
extension (PROBE), or competitive oligonucleotide single base extension 
(COSBE), sequencing, or direct detection of small amplified products. 
BiomassScan 

In this embodiment, exemplified in Example 33, two single 
10 stranded nucleic acids are individually immobilized to solid supports. One 
support contains a nucleic acid encoding the wild type sequence whereas 
the other support contains a nucleic acid encoding a mutant target 
sequence. Total human genomic DNA is digested with one or more 
restriction endonuclease enzyme resulting in the production of small 
15 fragments of double stranded genomic DNA (10-1,000 bp). The digested 
DNA is incubated with the immobilized single stranded nucleic acids and 
the sample is heated to denature the DNA duplex. The immobilized 
nucleic acid competes with the other genomic DNA strand for the 
complementary DNA strand and under the appropriate conditions, a 
20 portion of the complementary DNA strand hybridizes to the immobilized 
nucleic acid resulting in a strand displacement. By using high stringency 
washing conditions, the two nucleic acids will remain as a DNA duplex 
nly if there is exact identity between the immobilized nucleic acid and 
the genomic DNA strand. The DNA that remains hybridized to the 
25 immobilized nucleic acid is analyzed by mass spectrometry and detection 
of a signal in the mass spectrum of the appropriate mass is diagnostic for 
the wild typ or mutant allel . In this manner, total genomic DNA can be 
isolated f r m a biological sample and screened for the pres nee or 
absence of certain mutations. By immobilizing a variety of single 
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.sanded nuCeic acids in an array forn«,. a pane, af n,u.ation. „av be 
«m„,.a„eous.y screened for a number ., «e„e«„ k,ci ^ "^^o, 
.n addi^on. ^i„, ,ese s«n«ent wasHin, condi^tT^eT^::^ 
ONA s.a„d may be anaiyzed by n«ss specc^etry ,or Chan JT 
mass resu,«n9 fron, a de.e«on or insertion wi«,in *e targeted res^^„ 
endonuolease fragment. a"«> restnction 

PHmer oligonucleotide base extension 

As described in detail in the following Examole IJ • 

10 r - - : r™- 

den^hes the exact number of repeat units fl^ „ J 

■o homogenous stretches, as well as second site muta«„„. 
polymorphic region, which are otherwise only detectabte h„ 
^us. the PaOBa technique Increases «,e toll n::r r^Zr " 
a. eles at a distinct genomic site, leading ,o a higher polymorphism 
.nfom,at.on content IPIC, and yielding a far more defmible lde^:L«„„ 
-. for .nstance statistics-based analyses in paternity or forensics 
applications. 



IS 



_ The metf,od is based on the extension of a detection primer ti,at 
anneals adiacen. to a variable nucleotide tandem repeat iV^Z T 
20 polymorphic mononucleotide sti-etch using a ONA polymerase,!. 1 

ZZTZ r'^: ""^"^'^^ ^ - not 

pr^n. ,„ the deoxy fom,. The resuWng pr«,uc.s are eva,uated and 

^s^ved by MALOI-TOP mass spectrometry w«hout further labeling of 

25 it T; ' 28 unrelated 

"Hn t' — • 

was ,n «,e worst case 0.38% (Se-mer). which is comparable to 

.^.roxlmately o., base accuracy; routine standard mass deviations are in 
*a r«.g „, 0.1 % ,.03 bases,. Such accuracy witi. conventional 
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• thnds is not realistic, underscoring the value of PROBE 
.»l«»ctroDhoretic methods is noi \v» 

electropnore ,„,^„sic medicine and paternity testing. 

™.es possible *e si,nu«anecus n«asuremen. o. a« ~s o, a 
re«. ..on, n»ss —es instead ., base cou„.u^ Ten 4 

A.di«ena»v. *e mass di«e,e„ces be«,een adiacen. bases 
generated .om uni<a.e«, de,«da«on in a «epwise ""^^^ 
,0 Lnuciease can be used to read *e endre sequence o. frafln«n« 
.H Whereas UV or fluorescent n«asu«n«ntsw,llnot 
generwed. Vfltereas UV generated 

discrtn*«te ,ni«ures o. ^"^^ ^„ ,3 ^ i* 

«,en «.e e»nuc.e,se -^^^^^ ,„ „„«ng between 
™ss specon^trv ^J^^'^J , signi«cant. Tbe 

,B n^iecuiar mass o. d^ dT^d ^ ^^^^^ 

mass of the adjacent bases lue^. nuo. 

using fast Atorr-c Bombardment ,.A« or Eiectrcnsprav 

ioniza^cn ,ES>, n««. spe«rome«v ^^^^^ _ ^ 

New mutation screening uv^ . - 

""^ '''partia. sequence in.om«tion obtained from tandem mass 

. can place composition constraints as descnbedrn the 

spacbometrv <"S^ ""P ' „, fragment 

::n:r:c— t-ted dissoca^on .CAO, .bicb di«er . 
3 Oa discounts T.C„G„ .*.cb contains no A nuCectides; con.rm.g 
™,e then a singie A eUmioates AT,C.G., as a possible ccmpos««n. 



wo 98^0166 



PCT/CJS97/20444 



-70- 



0 



MS" can also be used to determined full or partial sequences of 
larger DNAs; this can be used to detect, locate, and identify new 
mutations in a given gene region. Enzymatic digest products whose 
masse^s are correct need not be further analyzed; those with mass shifts 
could be isolated in real time from the complex mixture in the mass 
spectrometer and partially sequenced to locate the new mutation. 

Table I describes the mutation/polymorphism detection tests that 
have been developed. 

Table I 

Mutation/Polymorphism Detection Tests 



Clinical Associa tion 
Cystic Fibrosis 

Heart Disease (Cholesterol 
Metabolism} 



► i Thyroid Cancer 



Sickle Cell Anemia/ 
Thalassemia 

HIV Susceptibility 

Br est Cancer 
Susceptibility 

Thrombosis 



Gene 
CFTR 

Apo E 
Apo A-IV 
Apo B-100 



Mutation/Polymorphism 

38 disease causing mutations 
«n 14exons/introns 

112R, 112a 158R, 158C 
347S, 347T. 360H, 360Q 
3500Q, 3500R 



Arteriosclerosis 
Hypertension 



RET proto- 
oncogene 

beta-globin 

CKR-5 
BRCA-2 

Factor V 

Gplila 
E-sele ctin 

ACE 



C634W, C634T, C634R 
C634S, C634F 

Sickle cell anemia S and C 
45 thalassemia a lleles 

32bp deletion 
2bp (AG) deletion in exon 2 



R506Q 

L33P 
S128R 



l/D polymorphism 



Det ction of mutations 

Diagn sis f genetic diseases 

The mass spectrometric proc sses described above can be used 
for example, to diagnose any of the more than 3000 genetic diseases ' 
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currenay known fe^ henK,phi«as, *alassen,,,s. Ouchenne Muscular 
Dystrophv (DMO,. HuntTngton-a Disease (HO,. Afehe Ws DIseaselrl 
Cystic Rbrosis (CF), or to be identified. * 

™" "^'""""S 3 P^'^'e. a mass spectrometric method 

S ^r detecfn, a mu.a«on (^F60« „, the cys«c fibrosis transmelle 
co«.ucu>nce regulator gene (CFTK,. which differs by only threetT 
pairs ,300 da.o„s, from the wild type o, CFTR gene As dTs^l^r 
further ,„ Example 3. me detection is based on a sinole** 
o..-gonucleo.de single base e«ens.n <COSs;„"::: trg a^p^r 
10 pnmers with «,e S'-terminal base complementary to either 1\ 

mutant allele. Upon hybndi^aon and addition o" a ^.^1:7;°' 
nucleoside tnphosphate one base downstream, only thoTeTnlrs 
property annealed (...e. no 3'-tem,lnal mismatch) are extenL w 

^deson.t,on„n.zat,ontime^,-fligb. mass spectrometry. F„r .he 

mutant (M) pnmers generate 29- and Sl-mers for N and M 
homozygotes. respec«vo.y. and both for he.ero.ygo.es. Since primer 

20 Tss T" ""T" """"'^ '"""^ '° on'^e 

massdifferencebetweentheseareatfeastthatofaslngle -300 Da 
nucleotide unit, low resoluhon instrumentadcn is suitable for such 

measurements. 

Tl.em»sequence cycle sequencing, as further described in 
Example 1 1. is also useful for detectins a genedc disease 

c r,«- l"^"^" *° ^ result in genetic disease 

c rta.„ b«h defects are the result of chromosomal abnormalities su^h 
Tnsomy 2, (Down's Syndr me). Trisomy 13 (Patau Syndr Tr 

s X chromosome aneuploidies such as KlienfeHer-s Syndrome (XXY,. 
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Here, "house-keeping'' genes encoded by the chromosome in question 
are present in different quantity and the different amount of an amplified 
fragment compared to the amount in a normal chromosomal 
configuration can be determined by mass spectrometry. 
5 Further, there is growing evidence that certain DNA sequences 

may predispose an individual to any of a number of diseases such as 
diabetes, arteriosclerosis, obesity, various autoimmune diseases and 
cancer (e.g.. colorectal, breast, ovarian, lung). Also, the detection of 
*DNA fingerprints*, e.g. . polymorphisms, such as "mini- and micro- 

10 satellite sequences*, are useful for determining identity or heredity ( e.g., 
paternhy or maternity). 

The following Examples 4 and 1 2 provide mass spectrometer 
based methods for identifying any of the three different isoforms of 
human apolipoprotein E, which are coded by the E2, E3 and E4 alleles. 

15 For example, the molecular weights of DNA fragments obtained after 
restriction with appropriate restriction endonucleases can be used to 
detect the presence of a mutation and/or a specific allele. 

Depending on the biological sample, the diagnosis for a genetic 
xiisease, chromosomal aneuploidy or genetic predisposition can be 

20 preformed either pre- or post-natally. 
Diagnosis of cancer 

Preferred mass spectrometer-based methods for providing an early 
indication of the existence of a tumor or a cancer are provide herein. For 
example, as described in Example 1 3, the telomeric repeat amplification 
25 protocol (TRAP) in conjunction with telomerase specific extension of a 
substrate primer and a subsequent amplification of the telomerase 
specific extension products by an amplification step using a second 
prim r complementary to the repeat structure was used to obtain 
extension ladders, that were easily d tected by MALDI-TOF mass 
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spectrometry as an indication of telomerase activity and therefor 
tumorigenesis. 

Alternatively, as described in Example 14. expression of a tumor or 
cancer associated gene (e.g. . human tyrosine 5-hydroxylase) via RT-PCR 
5 and analysis of the amplified products by mass spectrometry can be used 
to detect the tumor or cancer (e.g.. biosynthesis of catecholamine via 
tyrosine 5-hydroxylase is a characteristic of neuroblastoma). 

Further, a primer oligo base extension reaction and detection of 
products by mass spectrometry provides a rapid means for detecting the 
10 presence of oncogenes, such as the RET proto oncogene codon 634, 
which is related to causing multiple endocrine neoplasia, type II (MEN II), 
as described in Example 1 5. 
Diagnosis of infection 

Viruses, bacteria, fungi and other infectious organisms contain 
1 5 distinct nucleic acid sequences, which are different from the sequences 
oontained in the host cell. Detecting or quantitating nucleic acid 
sequences that are specific to the infectious organism is important for 
diagnosing or monitoring infection. Examples of disease causing viruses 
that infect humans and animals and which may be detected by the 
20 disclosed processes include: Retroviridae (g^. human immunodeficiency 
viruses, such as HIV-1 (also referred to as HTLV-III, LAV or HTLV- 
lll/LAV, see, e.g.. Ratner ej aK (1985) Nature 313 : 227-284; Wain- 
Hobson et ah (1985) Cell 40:9-17); HIV-2 (see, Guyader et aL (1987) 
Nature 323:662-669 European Patent Publication No. O 269 520; 
25 Chakrabarti fij aL d 987) Nature 328:543-547; and European Patent 

Application No. O 655 501); and other isolates, such as HIV-LP 
(Int mat! nal PCT application No. WO 94/00562 ntided "A Novel 
Human Immunodeficiency Vims': Picomaviridae (e.g.. polio viruses, 
hepatitis A virus, (see, c^. Gust ej sL (1983) Intervirofogy 20:1-7); 
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entero viruses, human coxsackie viruses, rhinoviruses, echoviruses); 
CalcivMdae te^ strains that cause gastroenteritis); Togaviridae (e^ 
equine encephalitis viruses, rubella viruses); Flaviridae (e^ dengue 
viruses, encephalitis viruses, yellow fever viruses); CoronavMdae (e^ 
5 coronaviruses); Rhabdoviridae (e^ vesicular stomatitis viruses, rabies 
viruses); Filoviridae (e^ ©bola viruses); Paramyxoviridae <e^ 
parainfluenza viruses, mumps virus, measles virus, respiratory syncytial 
: virus); Orthomyxoviridae (e^ influenza viruses); BungavMdae (e^ 
Hantaan viruses, bunga viruses, phleboviruses and Nairo viruses); Arena 
10 viridae (hemorrhagic fever viruses); Reoviridae (g^ reoviruses, 

orbiviruses and rotaviruses); Bimaviridae; HepadnavMdae (Hepatitis B 
virus); Parvoviridae (parvoviruses); Papovaviridae (papilloma viruses, 
polyoma viruses); Adenoviridae (most adenoviruses); HerpesvMdae 
(herpes simplex virus (HSV) 1 and 2, varicella zoster virus, 
15 cytomegaovirus (CMV), herpes viruses'); Poxviridae (variola viruses. 

vaccinia viruses, pox viruses); and Iridoviridae (e^, African swine fever 
virus); and unclassified viruses (e^, the etiological agents of Spongiform 
' encephalopathies, the agent of delta hepatitis (thought to be a defective 
satellite of hepatitis B virus), the agents of non-A. non-B hepatitis (class 
20 1 = Internally transmitted; class 2 = parenterally transmitted (i.e.. 
Hepatitis C); Norwalk and related viruses, and astro viruses). 

Examples of infectious bacteria include, but are not limited to: 
Helicobacter pyloris. Borelia burgdorferi. Legiortella pneumophilia. 
Mycobacteria sps (g^ M. tuberculosis. M. avium. M. intracellulare. M. 
25 kansau. M. gordohae). Staphylococcus aureus. Neisseria gonorrhoeae. 
Neisseria meningitidis. Listeria monocytogenes. Streptococcus pyogenes 
(Group A Streptococcus). Streptococcus agalactiae (Group 8 
Str pt coccus). Streptococcus (viridans group). Streptococcus faecalis. 
Streptococcus bovis. Streptococcus (anaerobic sps.). Streptococcus 
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phhenumoniae. pathogenic Campylobacter sp., Enterococcus sp., 
Haemophilus influenzae. Bacillus antracis, corynebacterium diphtheriae. 
corynebacterlum sp., Erysipelothrix rhusiopathiae. Clostridium 
perfringers, Clostridium tetani, Enterobacter aerogenes, Klebsiella 
pneumoniae, Pasturella multocida, Bacteroides sp., Fusobacterium 
nucleatum, Streptobacillus moniliformis. Treponema pallidium. 
Treponema pertenue, Leptospira, and Actinomyces israelii. 

Examples of infectious fungi include: Cryptococcus neoformans, 
Histoplasma capsulatum, Coccidioides immitis, Blastomyces dermatitidis. 
Chlamydia trachomatis, Candida albicans. Other infectious organisms 
(i.e., protists) include: Plasmodium falciparum and Toxoplasma gondii. 

The processes provided herein makes use of the known sequence 
information of the target sequence and known mutation sites. Although 
new mutations can also be detected. For example, as shown in FIGURE 
8, transcription of a nucleic acid molecule obtained from a biological 
ample can be specifically digested using one or more nucleases and the 
fragments captured on a solid support carrying the corresponding 
complementary nucleic acid sequences. Detection of hybridization and 
the molecular weights of the captured target sequences provide 
20 information on whether and where in a gene a mutation is present. 
Alternatively, DNA can be cleaved by one or more specific 

ndonucleases to form a mixture of fragments. Comparison of the 
molecular weights between wildtype and mutant fragment mixtures 
r suits in mutation detection. 

r' 

Sequencing by generation off spectficaHy terminated fragements 

In another embodiment, an accurate sequence determination of a 
r latively large targ t nucleic acid, can be btained by gen rating 
specifically t rminated fragments from the target nucleic acid, 
d termtning th mass of each fragment by mass spectrometry and 



15 



25 
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ordering fragment .o determine the sequence ot the larger targe, 
nuc.e.0 acid. .„ , preferred en,hodi.ent, the speclficalV termlnatr 
frag-nents ^ par«a, or complete base-.peciflca,lv terminated fragments 
. , One method for generating base specifically ,em,lnated fragments" 

T ' "'^'^'^ "■•""""'^'^ ^ ■> —'In 
re^on. -deferred base-speclfic rlhonuCeases are selected from among- 
Vnbonudease (G-speciflc). U^-ribonuclease (A-spedflc.. PhyM- 
nbonuclease U specific and ribonuclease A (U/C specific) Other • 
and base-specl^c Hbonucleases can be iden««ed Ig 1 
described In Example ,6. Preferably modified nucleotides are included in 

«.e modmed nucleotides and unmodified nucleotides are added to «,e 
transcnption reaction a. appropriate concentrations, so «,a. both moieties 
a^e incorporated at a p^ferentia. rate of about AltematK,e.y. ZT 

separate transcrtptions Of «,e target DNA sequence one With J 
n>od,fied and one with the unmodified nucleotides can be perfomied and 
.*e results compared. Prefened modified nucleotides indole: b^n T 
^om„e modified nucleotides (Porter ^ ^ ,,995, g,,H«^ 

nud J ''''''''^'■^^^^••^'^^"''•^ 

nucleobdes. as well as mass-modlfied nucleotides as described above 

'^<'*«"™*<'<"<'r generating base specifically tem,lnated 
fragments involves perfom,ing a combined amplification and base- 

25 ^--P-a-a combined amplification and 

tennrnatron reaction can be performed using at least two different 

polymerase enzymes, each having a different affinity for the chain 

tem^nabng nucleotide, so that polymerization by an enzyme with 

relatrvaly tew affinity for the chain .e,„,lnating nuci o«de leads .0 

xponentia. amplification wh reas an enzyme with r ,a«v ,y high affinity 
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for the chain terminating nucleotide terminates the polymerization and 
yields sequencing products. 

The combined amplification and sequencing can be based on any 
amplification procedure that employs an enzyme with polynucleotide 
5 synthetic ability (e.g., polymerase). One preferred process, based on the 
polymerase chain reaction (PGR), includes the following three thermal 
steps: 1) denaturing a double stranded (ds) DNA molecule at an 
appropriate temperature and for an appropriate period <ST time to obtain 
the two single stranded (ss) DNA molecules (the template: sense and 
10 antisense strand); 2) contacting the template with at least one primer 
that hybridizes to at least one ss DNA template at an appropriate 
temperature and for an appropriate period of time to obtain a primer 
containing ss DNA template; 3) contacting the primer containing template 
at an appropriate temperature and for an appropriate period of time with: 
15 (i) a complete set of chain elongating nucleotides, (ii) at least one chain 
terminating nucleotide, (iii) a first DNA polymerase, which has a relatively 
low affinity towards the chain terminating nucleotide; and (iv) a second 
DNA polymerase, which has a relatively high affinity towards the chain 
terminating nucleotide. 
20 Steps 1)-3) can be sequentially performed for an appropriate 

number of times (cycles) to obtain the desired amount of amplified 
sequencing ladders. The quantity of the base specifically terminated 
fragment desired dictates how many cycles are performed. Although an 
increased number of cycles results in an increased level of amplification, 
25 It may also detract from the sensitivity of a subsequent detection. It is 
therefore generally undesirable to perform more than about 50 cycles, 
and is mor preferable to perform less than about 40 cycles ( e.g.. about 
20-30 cycles). 
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Anotf,er preferred process for simutaneously amplifying and chain 
~, a nuoieic acid sequence is based on strand dispiace J„ 

SuandD,splacemen. Amplification Using Thern,opl,i,ic Enzymes-) ,„ 
essence, this process involves the following three steps, which 

consti««e a cyCe: „ denatuHng a d«.ble stranded ,ds, DNA 
molecule containing the sequence to be «np|ified a, an appropriate 
temperature and for an appropriate period of time u. obtain the two 

strand). 2) contacting the template witi, at leas, one primer (P) «,a, 

contams a recognition/cleavage site for a restiiction endonucloase (RE, 

3nd that hybHdi.es to at least one ss DNA template a. an apt pi' e 

temperature and for an appropriate period of time to obtain a primer 

co„„.n,ng ss ONA template; 3) contacting the primer containi^gZplate 
= an app .,„,,,3,„,, ^ ^^^^^^^^ ^^^^ ^.e 

. .) a complete set of chain elongating nucleotides; ,ii) a, leas, one chair 

L7a"ff Z " '"^ ^'^ - - "Mvely 

20 pZ *° ""-^ «v. a second DNA 

polymerase. Which has a relatively high affinity towards the chain 
tem«nating nucleotide; and (v) an RE tf,at nicks the primer 

recognition/cleavage site. 

Steps 1 )-3) can be sequentially perfomied for an appropriate 
number of times (cycles, to obtain ti,e desired amount of amplified 
25 sequencing ladders. As with ti,e PCR based process. «,e quant^^o, ti,e 
base speciflcally .em,ina,ed fragment desired dictates how many cjc 
a™ perfom,ed. l^eferabiy. less than 60 cycles, more preferably less L 
about 40 cyces and most preferably about 20 to 30 cycles a„ 
performed. 



wo 9S/20166 



PCT/US97/20444 



-79- 

Preferably about 0.5 to about 3 units of polymerase is used in the 
combined amplification and chain termination reaction. Most preferably 
about 1 to 2 units is used. Particularly preferred polymerases for use in 
conjunction with PCR or other thermal amplification process are 
5 thermostable polymerases, such as Taq DNA polymerase (Boehringer 
Mannheim), AmpliTaq FS DNA polymerase (Perkin-Elmer), Deep Vent 
(exo-). Vent, Vent (exo-) and Deep Vent DNA polymerases (New England 
Biolabs), Thermo Sequenase (Amersham) or exo(-) Pseadococcus 
furiosus (Pfu) DNA polymerase (Stratagene, Heidelberg, Germany). 
10 AmpliTaq, Ultman, 9 degree Nm, Tth, Hot Tub, and Pyrococcus furiosus. 
In addition, preferably the polymerase does not have 5'-3' exonuclease 
activity. 

In addition to polymerases, which have a relatively high and a 
relatively low affinity to the chain terminating nucleotide, a third 
15 polymerase, which has proofreading capacity (e.g., Pyrococcus woesei 
(PwoJ) DNA polymerase may also be added to the amplification mixture 
to enhance the fidelity of amplification. 

Yet another method for generating base specifically terminated 
fragments involves contacting an appropriate amount of the target 
20 nucleic acid with a specific endonuclease or exonuclease. Preferably, the 
original 5' and/or 3' end of the nucleic acid is tagged to facilitate the 
ordering of fragments. Tagging of the 3' end is particulariy preferred 
when in vitro nucleic acid transcripts are being analyzed, so that the 
influence of 3' heterogeneity, premature termination and nonspecific 
25 elongation can be minimized. 5' and 3' tags can be natural ( e.q„ a 3' 
poly A tail or 5' or 3' heterogeneity) or artificial. Preferred 6' and/or 3' 
tags are selected from among the molecules described for mass- 
modification above. 
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The methods provided herein are further illustrated by the 
following examples, which should not be construed as limiting In any 

EXAMPLE 1 

S MALDI-TOF <.e,orp«„„ „, „lig„„„deo«<,,s dir,o«y on «„« «^p.^ 

' 9 CPG (Controlled Pore Glass) was functionalEed with 
3.(«iethoxysily,)-epoxypropan to form OH-groups on the poly„,er surface 
A s^ndard ollsonucleotide synthesis with ,3 n,, of the OH-^ron a 
DNA sv„«,esl„r ,MII„gen. Model 7500, employing ^-cyanoethyl- 
Phosphoamidites (K6stergaL( 19941 Nu,l^> ^ 

performed to synthesize a T i;n,««^ i- 

synmesize a 3 -T^-SOmer oligonucleotide sequence in wk.vk 

so nuc,eo«des are complementary to a -hypothetical ■ SoleTs:, ent 
ie J;,:!?'"*^'- "^-^ ~a in L«,a„o, 

Of *e DMT group CPS which contained about ,0 umo. SSmer/g CPG 

.s 100 ^ and contains about 1 nmol CPG bou«. SSmer as template an 

heated fo, ,0 m,n at 65»C and cooled to 37-C during 30- (annealing,. 

tZiT Poiymer-bound 
.e^|,te were removed by centrffugation and three subse,uen. 

wash-ng/centrifugaflon steps wHh 100 ul each of ice-cold 60 mM 

soludon (3.hydroxypicolinic acid/lOmM ammonium citrate in 

" "'T '"'^^ ^ectrometry. 
• ne results are presented in figures 1 0 and 1 1 . 
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EXAMPLE 2 

Eiectrospray (ES) desorption and differentiation of an 18-mer and 19-mer 
DNA fragments at a concentration of 50 pmole/ul in 

2-propanol/1 OmM ammoniumcarbonate (1/9, v/v) were analyzed 

simultaneously by an eiectrospray mass spectrometer. 

The successful desorption and differentiation of an 1 8-mer and 

19-mer by eiectrospray mass spectrometry is shown in FIGURE 12. 

EXAMPLE 3 - 

Detection of The Cystic Fibrosis Mutation AF508, by single step dideoxy 
extension and analysis by MALDI-TOF mass spectrometiy (Competitive 
Oligonucleotide Simple Base Extension = COSBE) 

The principle of the COSBE method is shown in FIGURE 13/N 
being the normal and M the mutation detection primer, respectively. 
MATERIALS AND METHODS 

PCR Amplification and Strand Immobilization. Amplification was 
carried out with exon 10 specific primers using standard PCR conditions 
(30 cycles: r@95*>C, 1'@55*>C, 2'@720C); the reverse primer was 5' 
labelled with biotin and column purified (Oligopurification Cartridge, 
Cruachem). After amplification the amplified products were purified by 
column separation (Qiagen Quickspin) and immobilized on streptavidin 
coated magnetic beads (Dynabeads, Oynal, Norway) according to their 
standard protocol; DNA was denatured using 0.1 M NaOH and washed 
with 0.1 M NaOH, IxB + W buffer and TE buffer to remove the non- 
biotinylated sense strand. 

COSBE Conditions. The beads containing ligated antisense strand 
were resuspended in 18//I of Reaction mix 1 (2>c/l 10X Taq buffer, 1 /jL 
(1 unit) Taq Polymerase, 2 //L of 2 mM dGTP, and 1 3 //L H^O) and 
incubated at 80**C for 5' before the addition of Reaction mix 2 (100 ng 

ach of COSBE primers). The temperature was reduced to 60**C and the 
mixtures incubat d for a 5' annealing/extension period; the beads were 
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.hen cashed in 25mM trie«,v„n,n,o™u„, ace«.e (TEAA, followed by 
bOmM ammonium citrate. 

/Wmer S6,<,e„ce^. All primers were syntheabed on a Peraeo«„ 
B„.ys.„« Exped,.e 8300 DNA Syn«.esl.er osln. co„yen«2 

02.4539). COSBE pnmers (each conttWng an intentional mismatch „ 
^ase hCore the 3..,em.„„s, were those „.ed in a previousT^s ' jr 
IFeme aaL (1992) Am J r-,.^, 51:251-262) with th 

two hases were removed trom thel-end oftLrrr ~ 

<sao T3-3- 

ExlO PGR (Reverse): 5'-GTG TGA AGG GTT CAT ATG C 

15 rr'' r^""" ''■^^'^ '■''^ ^^"^ AAA CAC CAC A 3' 

15 (28-mer) (SEQ ID No. 3) 

■ 3°,3a ^'.T" ^'^"^^^ ^^'^ '^^'^ ACC ATT- 

3 (30-mer) {SEQ ID No. 4) v. m i j 

«L 18 rr f "-"'"S- were resuspended in , 

«Ki. 0.7 M dibasic ammonium citrate in 1:1 H,0:CHiCN) and 
resuspended beads (Tang^^ ,.s.s) B^^LE^ja,,^^,^ 
_727-730, were m.xed on a sample tarae, and allowed to air dry. Up to 

souroe re8K,n of an unmodified Thermo B,oanalysis (formerly Rn„i„an, 
V.s»ns 2000 MAtOI-TOF operated in re«ectron mode with 5 and 20 v 
- *e target and conversion dynode. respecVely. r^r.,::t.^ 
mo. cular weights (H(calc,) were calculated ,™m atomic composers 
V ndor provided soft ^ , , centroi~ 
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extemal calibration; 1 .08 Da has been subtracted from these to correct 
for the charge carrying proton mass to yield the text M,(exp) values. 

Scheme. Upon annealing to the bound template, the N and M 
primers (8508.6 and 9148.0 Da, respectively) are presented with dGTP; 
5 only primers with proper Watson-Crick base paring at the variable (V) 
position are extended by the polymerase. Thus if V pairs with the 3'- 
terminal base of N, N is extended to a 8837.9 Da product (N + 1). 
Ukewise, if V is property matched to the M terminus, IVTis extended to a 
9477.3 Da M + 1 product. 
10 Results 

Figures 14-18 show the representative mass spectra of COSBE 
reaction products. Better results were obtained when amplified products 
were purified before the biotinylated anti-sense strand was bound. 

EXAMPLE 4 

15 Differentiation of Human Apolipoprotein E Isoforms by Mass 
Spectrometry 

Apolipoprotein E (Apo E), a protein component of lipoproteins, 
plays an essential role in lipid metabolism. For example, it is involved 
with cholesterol transport, metabolism of lipoprotein particles, 

20 immunoregulation and activation of a number of lipolytic enzymes. 

There are three common isoforms of human Apo E (coded by E2, 
E3 and E4 alleles). The most common is the E3 allele. The E2 allele has 
been shown to decrease the cholesterol level in plasma and therefore 
may have a protective effect against the development of atherosclerosis. 

25 The ONA encoding a portion of the E2 allele is set forth in SEQ ID No 

130. Finally, the E4 isoform has been con-elated with increased levels of 
cholest rol, conferring predisposition to ath rosclerosis. Therefore, the 
identity of the apo E allele of a particular individual is an important 
d t rminant of risk for the development of cardiovascular disease. 
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As shown in Figure 1 9, a sample of DNA encoding apolipoprotein 
E can be obtained from a subject, amplified {e^ via PCR); and the 
amplified product can be digested using an appropriate enzyme {e^ 
Cfol). The restriction digest obtained can then be analyzed by a variety 
5 of means. As shown in Figure 20. the three isotypes of apolipoprotein E 
(E2, E3 and E4 have different nucleic acid sequences and therefore also 
have distinguishable molecular weight values. 

As shown in Rgure 21A-C, different Apolipoprotein E genotypes 
exhibit different restriction patterns in a 3.5% MetPhor Agarose Gel or 
10 12% polyacrylamide gel. As shown in Figures 22 and 23, the various 

apolipoprotein E genotypes'can also be accurately and rapidly determined 
by mass spectrometry. 

EXAMPLE B 
Detection of hepatitis B virus in serum samples. 
15 MATERIALS AND METHODS 

Sample preparation 

Phenol/choloform extraction of viral DNA and the final ethanol 
precipitation was done according to standard protocols. 
First PCR 

20 Each reaction was performed with 5/A of the DNA preparation from 

serum. 15 pmol of each primer and 2 units Taq DNA polymerase (Perkin 
Elmer, Werterstadt, Germany) were used. The final concentration of 
each dNTP was 200pMtA. the final volume of the reaction was 50 fA. 
lOx PCR buffer (Peridn Elmer, Weiterstadt, Germany) contained 100 mM 

25 Tris-HCI. pH 8.3, 500 mM KCI, 1 5 mM MgClj, 0.01 % gelatine (w/v). 
Primer sequences: 
Prim r SEQUENCE SEQIONo. 

1 5'^cnTGGGGCATGGACATTGACCCGTATAA-3' 5 

2 5'<rrGACTACTAATTCCCTGGATGCTGGGTCT-3' 6 
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Nested PCR: 

Each reaction was perforrned either with 1 /4 of the first reaction 
or with a 1 :10 dilution of the first PCR as template, respectively. 100 
pmol of each primer, 2.5 u Pfuiexo-) DNA polymerase (Stratagene, 
5 Heidelberg, Germany), a final concentration of 200 pM of each dNTPs 
and 5 fA lOx Pfu buffer (200 mM Tris-HCI, pH 8.75, 100 mM KCI. 100 
mM {NH4),S0^, 1% Triton X-100, 1 mg/ml BSA, (Stratagene, Heidelberg, 
Germany) were used in a final volume 50 jA. The reactions were 
performed in a thermocycler (OmniGene, MWG-Biotech, Ebersberg, 
10 Germany) using the following program: 92 «C for 1 minute, 60 "C for 1 
minute and 72-C for 1 minute with 20 cycles. Sequence of 
oligodeoxynucleotides (purchased HPLC-purified from MWG-Biotech, 
Ebersberg, Germany): 

HBV13: 5'-TTGCCTGAGTGCAGTATGGT-3' (SEQ ID NO. 7) 

15 HBVISbio: Biotin-5'-AGCTCTATATCGGGAAGCCT-3' (SEQ ID NO. 8) 
Purfffcation of ampffffed products: 

For the recording of each spectrum, one PCR, 50 pi, (performed as 
described above) was used. Purification was done according to the 
following procedure: Ultrafiltration was done using Ultrafree-MC filtration 

20 units (Millipore, Eschbom, Germany) according to the protocol of the 
provider with centrifugation at 8000 rpm for 20 minutes. 25p\ (10//g///|) 
streptavidin Dynabeads (Dynal, Hamburg, Germany) were prepared 
according to the instructions of the manufacturer and resuspended in 
25/il of B/W buffer (10 mM Tris-HCI, pH 7.5, ImM EDTA, 2 M NaCI). 

25 This suspension was added to the PCR samples still in the filtration unit 
and the mixture was incubated with gentle shaking for 1 5 minutes at 
ambient t mperature. The suspension was transferr d in a 1.5 ml 
Eppend rf tube and the supernatant was removed with the aid of a 
Magnetic Particle Collector. MPC. (Dynal, Hamburg, Germany). The 
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beads were washed twice with 50^ of 0.7 M ammonium citrate 

'^^^ -h .me usi„g the 

MPC). Cleavage from the beads can be accomplished by usinc 

^°""--de-t90oc. The supernatant was dried in a speedvac for ah . 
B an hour and resuspended in 4 ^ of u.trapure water (Mi^iO Jp p J 

^S:^^ • ~" - - -.Oi- 

MALDI-TOF MS: 

holder, «,e„ .mmediatov mixed Witt, 0.5 M mauix solution (O 7 L 
hydroxvpioolinic acid 50% aoetonltrile, 70 n,M ammonium oit^ate. "tk- 
mixture was dried at amblen. temperature and introduced C 'e 
spectrometer. All spect. were taKen in positive ion mode uC a ""^ 
Rnnigan MAT Vision 2000 (Finnigan MAT Bremen , 
IB -are«ectron,5.eVionsouri..0.e;::s:::r:r^^^^^^^^^^^^ 

^00.„«, Each sample was measured „iti,di«eren. laser energies In 

nega«ve samples. .He ampli«ed product was detected neither with 
■less nor wti, higher laser energies. In «,e positive samples th . 

- - P^uc. was detected at dl.erent Places o, the san::;^^^^^^ 
With varying laser energies. 

RESULTS 

" "^^ system was used for ti,e detection o, HBV ONA in 
blood samples employing oligonucleotides complementary to fh» 
- -*r"-~.PHmer,:hegi„ningatmap%ositi:r,;;rpr^^^ 
beg.n.g at map position ti,e complementaiy strand, encoding 

*e HBV core antigen (HBVcAg,. DNA was isolated from patienti= serul 
according to standard protocols. A first PCR was performed 
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,.«™,susina a first set of primers. If HBV DNA 
DNA from these preparations using o 

^ DNA fragment of 269 bp was generated, 

was present in the sample a DNA irag „..„,,„. 

,n «,e second reaction, primers which were complementary to a 
„glon wHHin the PCR fragment generated in the first « 
e HBV related an^iifled products were present in *e.rstPCR a DNA 

fragment of 67 bp was generated (see Rg. 2BA) .n this nested PCR. The 

and aiso serves as a specifiCtv control for the e«em. ^^R JRoHs 
,,992, PCR: ainical Diagnostics and Research. Spnnger. Heidelberg,. A 
,0 further adv»«age is that the amount o, fragments ,ener«^m^*e 

second PCR is high enough to ensure an unprobiematic de.ec«on 
although pudfication losses can not be avoKled. 

T^e samples w«e purified using ultrafiltration to restreptaVKlin 
Dvnabeads. TOs purification was done because the shoner prinw 

,5 figments were immob..i.ed in higher yield on the ^^^^ » 

Tu ■ mobilization was done directly on the Ultrafiltration 
reasons. The immobilization woa 

'A <=..hstance losses due to unspecific absorption on the 
membrane to avoid substance loss 

- --.^rthiliTation the beads were washed with 
membrane. Following immobilization, th 

ammonium citrate to perfonn cation exchange (Pieles si SU <19 ' — - 
20 Ada^2, -.3,91-3,96,. The immobilized DNA was cleaved from the 
t^ngls* ammonia which allows cleavage of DNA from the 
beads in a very short time, but does not resuH in an introduction of 

sodium or other cations. 

The nested PCRs and the MALOl TOF analysis were performed 

. • « th*. results of serological analysis. Due to the unknown 
25 without knowing the results oi » ^ ^ , 

. * first PCR was used undiluted as template 
virus titer, each sample of tiie first r^^n wa 

and In a 1:10 dilution, r spectively, 

sample 1 was c Hected from a patient witi. chronic active HBV 
in, ction who was positive in Hbs- and Hbe-antigen tests but nega^ve in 
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do, b>o, analysis. Sample 2 was a serum sample from a pa«em wW, 
an ac^e HBV .„,ec«„„ and a massive viramia who was HBV posi.vl 
a do, bio. anaivsis. Sample 3 was a dena,ure<. serum sample Lr^e 
no serological analysis oould be performed by an inoreased lev* oT 
,ransamin,ses indioating liver disease was de,eo,ed. m au«,radio<,r«,h 

- »™'va.a (Figure 24,. *e firs. PCRo,«,is sample was neaa«ve 
Neverttwiess, ttere was some evidence of HBV infection Th; 

" Of in,er^ for M^VU^I-TO. analysis, because i. demir: J^rer " 
-ow-level amount of amplified producu can be de,ec,ed after «,e 

»BV^r:'^T'^- Who was cured Of 

HBV nfec^on. Samples 5 and 6 were colleced from parents wi* a 
clironic acfive HBV infection. » vwm a 

Figure 24 shows the resul,s of a PAGE analysis of «,e nes,ed PCR 
reacnon. A amplified produc, is clearly revealed in samples 1 2 3 B 
and 6 In sample 4 no amplified produc, was genera,ed. i. is tadeed HBV 
negative, according .o ti,e serological analysis. Negative and porve 

a7e 'TT ^ 

.are v.s,ble ,n lanes 2. S. 6 and ^ if non-di.u,ed .empla,e was used 
20 22 ^"""^ " x-d in a ,;,0 

^ '^•^ """"" "--'V -^--'"a if ti,e 
^P^,e was no, dilu.ed. ^e rasulu of PAGE analysis are in agreemen. 
w* the d«a ob,a.ned by serological analysis excep, for sample 3 as 
discussed above. 

2S from ."T '""^ " "^"^ °' ' P'odoc, 
from sample number , genera,ed and purified as described above T,.e 

s.gna a. 207S4 Da represent ti.e single sfranded amplified produc. 

<=a.cu.a.ed: 20735 Da. as ti» average mass o, boti, sfrands of Z 

amplified produc, cleav d from ti« beads,. The mass diff rence of 

calcula,ed and obWned mass is ,9 Da ,0.09%,. As shown in Fig 25A 
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sample number 1 generated a high amount of amplified product, resulting 
in an unambiguous detection. 

Fig, 25B shows a spectrum obtained from sample number 3. As 
depicted in Fig. 24, the amount of amplified product generated in this 
5 section is significantly lower than that from sample number 1 . 

Nevertheless, the amplified product is clearly revealed with a mass of 
20751 Da (calculated 20735). The mass difference is 16 Oa (0,08%). 
The spectrum depicted in Fig. 25C was obtained from sample number 4 
which is HBV negative (as is also shown in Fig 24). As expected no 

10 signals corresponding to the amplified product could be detected. All 
samples shown in Fig. 25 were analyzed with MALDI-TOF MS, whereby 
amplified product was detected in all HBV positive samples, but not in 
the HBV negative samples. These results were reproduced in several 
independent experiments. 

15 EXAMPLES 

Analysis of Ligase Chain Reaction Products Via MALDI-TOF Mass 
Spectrometry 

MATERIALS AND METHODS 

Oligodeoxynucleotides 

20 Except the biotinylated one and all other oligonucleotides were 

synthesized in a 0.2 pmol scale on a MilliGen 7500 DNA Synthesizer 
(Millipore, Bedford, MA, USA) using the i9-cyanoethylphosphoamidite 
method (Sinha, N.D. et al. (1984) Nucleic Acids Res 1 9;AKQQ->^t^77) 
The oligodeoxynucleotides were RP-HPLC-purified and deprotected 

25 according to standard protocols. The biotinylated oligodeoxynucleotide 
was purchased (HPLC-purified) from Biometra, Gottingen, Germany). 
Sequences and calculated masses of the oligonucleotides used: 

OCgodeoxy- SEQUENCE SEQ ID 

nudeoHkle . 

30 A 5'-p-TTGTGCCACGCGGTTGGGAATGTA (7521 Da) g 
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B S'-p-AGCAACGACTGTTTGCCCGCCAGTTG (7948 Da) ,o 

C 5'-blo-TACATTCCCAACCGeGTGGCACAAC(7960Da) i, 

O 5-^,-AACTGGCGGGCAAACAGTCGTTGCT(7708Da) ,2 

5 S-Phosphorylation of oligonucleotides A and D 

This was performed with polynucleotide kinase (Boehrinaer 
Mannheim, Germany) according to published procedures, the ' 
5'.phosphorylated oligonucleotides were used unpurified for LCR. 
Ugase chain reaction 

'O ^= '■<='' "-"^P^rfo^^" with fl^-DNAIigase and a ligase chain 

reac«,„ ,s.„U8ana. Heide,.e.«. Se™,anW containing .1 J n. 
pauescnp. KU phaaan«ds. One carrying *c wiMtype forn, of *e f d, 
tool gana and tte o*ar ona a mutant of this gana with a singra p„,n, 
mutation at bp 1 91 of the feel gana. 
'S The following LCR conditions ware used for each reaction: 1 00 pa 

template ONA ,0.74 ftnci) wfth SCO pg sonif ed salmon sperm DNA as 
earner 25 ng ,3.3 pm.1, „, each 5-.phosphoryla,ad oligonucleotide. 20 
ng (2.S pmoll of each noni,hospho,yla,ed oligonuclaotida. 4 U Pfu ONA 

20 template, .n ,h,s case oBgo C was also biotinylated. All reactions were 
performed in a ti«rmocycler JOmnlGane. MWG-Biotech, Ebersberg 
German,, witf. tt« following program: 4 minutes 92-C. 2 minutes 'eo-C 
and 26 cycles cf 20 seconds 92»C. 40 seconds 60«C. Except for HPLC 
«nalys,s the biotinylated ligation eductC was used. In a control 
25 xpenment ti>e biotinylated and non-biotinylated oligonucleotides 
reverted tt,e same gel electtophoretic results. The reactions were 
analyzed on 7.5% polyacrylamide gals. Ugation product 1 ,olig„ A and 
B) calculated mass: 15450 Da, ligation product 2 ,oligo C and D) 
calculated mass: 15387 Oa. 
30 SMART-HPLC 
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lon exchange HPLC (IE HPLC) was performed on the 
SMART-system (Pharmacia, Freiburg, Germany) using a Pharmacia Mono 
Q, PC 1,6/5 column. Eluents were buffer A (25 mM Tris-HCI, 1 mM 
EDTA and 0,3 M NaCI at pH 8,0) and buffer B (same as A, but 1 M 
5 NaCI). Starting with 100% A for 5 minutes at a flow rate of 50//l/min. a 
gradient was applied from O to 70% B in 30 minutes, then increased to 
100% B in 2 minutes and held at 100% B for 5 minutes. Two pooled 
LCR volumes (40//I) performed with either wildtype or mutant template 
were injected. 
10 Sample preparation for MALDI-TOF-MS 

Preparation of immobilized DNA: For the recording of each 
spectrum two LCRs (performed as described above) were pooled and 
diluted 1:1 with 2x B/W buffer (10 mM Tris-HCI, pH 7.5, 1 mM EDTA, 2 
M NaCI). To the samples 5 >t/l streptavidin DynaBeads (Dynal, Hamburg, 
15 Germany) were added, the mixture was allowed to bind with gentle 
shaking for 1 5 minutes at ambient temperature. The supernatant was 
removed using a Magnetic Particle Collector, MPC, (Dynal, Hamburg, 
Germany) and the beads were washed twice with 50 >t/l of 0.7 M 
ammonium citrate solution (pH 8.0) (the supernatant was removed each 
20 time using the MPC). The beads were resuspended In 1 7/1 of ultrapure 
water (MilliQ, Millipore, Bedford, Mabelow). 

Combination of ultrafiltration and streptavidin DynaBeads: For the 
r cording of spectrum two LCRs (performed as described above) were 
pooled, diluted 1:1 with 2x B/W buffer and concentrated with a 5000 
25 NMWL Ultrafree-MC filter unit (Millipore, Eschbom, Germany) according 
t the instructions of the manufacturer. After concentration the samples 
wer washed with 300/4 1x B/W buffer to streptavidin DynaBeads were 
added. The beads were washed once on the Ultrafr e-MC fiHxation unit 
with 300 //I of 1x B/W buffer and processed as described above. The 
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beads were resuspended in 30 to 50//f of 1x B/W buffer and transferred 
«n a 1 .5 ml Eppendorf tube. The supernatant was removed and the 
beads were washed twice with 50 //I of 0.7 M ammonium citrate (pH 
8.0). Rnally. the beads were washed once with 30 fA of acetone and 
5 resuspended in 1 pi of ultrapure water. The ligation mixture after 
Immobilization on the beads was used for MALDS-TOF-MS analysis as 
- described below. 

MALDl-TOF-MS 

A suspension of streptavidin-coated magnetic beads with the 
10 immobilized DNA was pipetted onto the sample holder, then immediately 
mixed With 0.5 ,A matrix solution (0.7 M 3-hydroxypicolinic acid in 50*^ 
acetonitrile, 70 mM ammonium citrate). This mixture was dried at 
ambient temperature and Introduced into the mass spectrometer. All 
spectra were taken in positive ion mode using a Rnnigan MAT Vision 
15 2000 (Finnigan MAT, Bremen, Germany), equipped with a reflectron (5 
keV .on source, 20 keV postacceleration) and a nitrogen laser (337 nm) 
For the analysis of Pfu DNA ligase 0.5 pi of the solution was mixed on 
the sample holder with 1 fA of matrix solution and prepared as described 
above. For the analysis of unpurified LCRs 1 pi of an LCR was mixed 
20 with 1 /A matrix solution. 
RESULTS 

The £. colilacX gene served as a simple model system to 
investigate the suitability of MALDl-TOF-MS as detection method for 
products generated In ligase chain reactions. This template system 
25 contains of an £. coli , eel wildtype gene in a pBluescript Kll phagemid 

and an E. co/i/ad gene carrying a single point mutation at bp 191 (C to T 
transition; SEQ ID No. 131) in the same phagemid. Four differem 
ol.gonucleotid s were used, which were ligated only if the £co///aci 
wildtype gen was present (Rgure 26). 
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LCR conditions were optimized using Pfu DNA ligase to oh. • 

reao^n. were analyzed by polyacryla^ide ge. eteotrophoresis (plo,, 
a^. HPcc on «.e SMART ava«„ ,R,„„a 27. an. L,. ^r^ ' 
S sho«« a PAGE „, a pos«ve LCR wNh w„d,ype .e„,p„^ .J,," 
. n^a.Va LCR w«h n,u«n. template „ anC 2, anC a nesafVe col' , 
Which con«.„. en^e. oHgonucleo«des and no .en,p,a« bu.sat!,„ 
SP^ ONA. The «e. elecophoresis CeaHy ahowa *a. ^.e^, 
Produc ,S0 hp, was p^daced only ,„ *e ™ac«on v*. 
'O ^.ate; whereas „eW,er «,e template carrylns the poln, „ 
*e control „acdon with salmon sperm DMA «eneratL aLpZ^" 
products, m Hgure 28, HPLC was used to analvze ^ 
-.-.VPe template per,om,ed under the sZ^LZl • T 

product was cearly revealed. «.ure 23 sHows the r^ults^ r:," . 
IS w..ch two pooled negative LCRs wfth mutant template Z^^Zl"" 
These chromatograms conflm, ^ data shown In Rgu„ 27 andlT 
results taken together cleady demonstn,te, tha, the system „ 
n^on^products In a .gnl«cant amount only I, the rZ^^: . 

" -orrercrziorrr:^^^^^^^^ 

These Include the ,our oligonucleotides ,A. B C and JT'T''- 
sooner iwiM, " synthetic ds 

(wrth the same sequence as the ligation product, the wlld,„ 
t mplate DNA. sonicated salmon sperm DNA and the P^ON^ 
25 ligation buffer. 'wse In 

In order to test which purification procedure should b. 
a LCH reaction can be analy. d by MAU.l-TOP.MS. aZ^orat 
unpunfied LCR ,Rgure 30A, and aliguots o, the enzyme stocr , 

30B, were ana^zed v..b MAU^I-TOP-ms. hTurcTo Jlr 
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appropriate s„^te preparation is absolutely necessary since all ,i„ . 
«.e .npurffie. LCR correspond .o signals o.«ai„e«. i„ L ^A ^pt; 
-a,ys.s o, lisase. The calculated „ass valuroTo J 1 

and the ligation product are 7S21 Da and 1 S450 D. ^° * 

= 30 Show that the en.y™ aoluln °r.~^^ 

wh.h do interfere .th expected signals o;«:t:n:^--': 
products and therefore makes an unambiguous sianal aT 

^-"eo-re. the spectra showed sZs oTIT"* 
Twee„20 being part of the enzyme storage buff^hicH « 
10 ™i.a..on beha.or of the a„alyte,maL. n-irarrr 

"»d. r:r:ntarce::r:::rd~ - 

invnobilized by Watson-Cnck .JJL^^o^Z^'^'^ 
- .ragmen, covelendy bound to the bLs 11^7:^^^ ''''' 

^^^^^T^Z^^IT 1 ""'"9 immobilized 

* L/iNA ensures that only the non-biotinylated stranH . •.. c 

non-immobi.i.ed dsDI^A is analyzed both rnrare r r^'"*^'- " 

Therefore, employing this system for LCR only the non-ligated 
Oligonucleotide A. with a calculated mass of 752, Da and ,h r ■ 

as resredtre^r ^^^^^^^^i: 

^^^^^ ^ 

Wenbfication of the LCR educts and products. """^"O"^ 
Figure 3,A shows a MALDI-TOF mass spectrum „h».- ^ 

-o pooled LCRs (perfonned as descHbed abo " p^Ti " l""" 

«v ) punhed on streptavidin 
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DynaBeads and desorbed directly from the beads showed that the 
purification method used was efficient {compared with Figure 30). A 
signal which represents the unligated oligo A and a signal which 
corresponds to the ligation product could be detected. The agreement 
5 between the calculated and the experimentally found mass values Is 
remarkable and allows an unambiguous peak assignment and accurate 
detection of the ligation product. In contrast, no ligation product but 
only oligo A could be detected in the spectrum obtained-from two pooled 
LCRs with mutated template (Figure SIB). The specificity and selectivity 
10 of the LCR conditions and the sensitivity of the MALDI-TOF detection Is 
further demonstrated when performing the ligation reaction In the 
absence of a specific template. Figure 32 shows a spectrum obtained 
from two pooled LCRs in which only salmon sperm DNA was used as a 
negative control, only oligo A could be detected, as expected. 
1 5 While tiie results shown in Figure 31 A can be correlated to lane 1 

pf the gel in Figure 27, the spectrum shown in Figure 31 B is equivalent 
to lane 2 in Figure 27. and finally also tiie spectrum in Figure 32 
corresponds to lane 3 In Figure 27. The results are in congruence with 
tiie HPLC analysis presented in Figures 28 and 29. While gel 
20 electrophoresis (Figure 27) and HPLC (Figures 28 and 29) reveal ehher an 
excess or almost equal amounts of ligation product over ligation educts, 
tiie analysis by MALDI-TOF mass specti-ometiv produces a smaller signal 
for the ligation product (Figure 31 A). 

The lower Intensity of tiie ligation product signal could be due to 
25 different desorption/ionization efficiencies between 24- and a 50-mer. 
Since tiie T„ value of a duplex witii 50 compared to 24 base pairs Is 
significantiy high r, mor 24-mer c uld be desorbed. A reduction In 
signal intensity can also result fr m a higher degree of fragmentation in 
cas of the longer oligonuci otides. 
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Regardless of the purification with streptavidin DynaBeads Figure 
32 reveals traces of Tween20 in the region around 2000 Da 
Substances with a viscous consistence, negatively influence the process 
Of crystallization and therefore can be detrimental to mass spectrometer 
analysis. Tween20 and also glycerol which are part of enzyme storage 
buffers therefore should be removed entirely prior to mass spectrometer 
analysis. For this reason an improved purification procedure which 
•ncludes an additional ultrafiltration step prior to treatment with 
DynaBeads was investigated. Indeed, this sample purification resulted in 
a significant improvement of MALDI-TOF mass spectrometric 
performance. 

Rgure 33 shows spectra obtained from two pooled positive (Fig 
33A) and negative (Fig. 33B) LCRs, respectively. The positive reaction 
was performed with a chemically synthesized, single strand 50mer as 
template with a sequence equivalent to the ligation product of oligo C 
and D. Oligo C was 5'-biotinylated. Therefore the template was not 
detected. As expected, only the ligation product of Oligo A and B 

Lgated oligo C and D. This newly generated DNA fragment is 
20 representedbythemasssignalof 15448DainRgure33A. Compared 
to Rgure 32A, this spectrum clearly shows that this method of sample 
preparation produces signals with Improved resolution and intensity. 

EXAMPLE 7 

Mutation detection by solid phase oliao base ovt»»^: 

as ^^^o,.ro.^.^^.zt',^Z";^,r^^:;,:^ 

Summary 

The so«d-phase oligo base extension method detects point 
mutations and small deletions as well as small insertions in amplified 
DNA. The mediod is based on tt,e extension of a detection primer that 
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anneals adjacent to a variable nucleotide position on an affinity-captured 
amplified template, using a DNA polymerase, a mixture of three dNTPs, 
and the missing one dideoxy nucleotide. The resulting products are 
evaluated and resolved by MALDI-TOF mass spectrometry without 
5 further labeling procedures. The aim of the following experiment was to 
determine mutant and wildtype alleles in a fast and reliable manner. 
Description oftfte experiment 

The method used a single detection primer followed by a 
oligonucleotide extension step to give products differing in length by 
10 some bases specific for mutant or wildtype alleles which can be easily 
resolved by MALDI-TOF mass spectrometry. The method is described by 
using as example the exon 10 of the CFTR-gene. Exon 10 of this gene 
bears the most common mutation in many ethnic groups (AF508) that 
leads in the homozygous state to the clinical phenotype of cystic fibrosis. 
1 5 MATERIALS AND METHODS 
. Genomic DNA 

Genomic DNA were obtained from healthy individuals, individuals 
homozygous or heterozygous for the AF508 mutation, and one individual 
heterozygous for the 1 5068 mutation. The wildtype and mutant alleles 
20 were confirmed by standard Sanger sequencing. 

PCR amplification of exon 10 of ttie CFTR gene 
The primers for PCR amplification were CFExlO-F (5- 
GCAAGTGAATCCTGAGCGTG-3' (SEQ ID No. 13) located in intron 9 and 
biotinylated) and CFExlO-R (5'-GTGTGAAGGGCGTG-3' SEQ ID No. 14) 
25 located in intron 10), Primers were used in a concentration of 8 pmol. 
Taq-polymerase including lOx buffer were purchased from 
Bpehringer-Mannheim and dTNPs were obtained from Pharmacia, Th 
total r acti n volume was 50 lA. Cycling conditions for PCR were initially 
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5 min. at 95«>c, followed by 1 min. at 94oc, 45 sec at 53oc and 30 
sec at 720C for 40 cycles with a final extension time of 5 min at 72oc 
Punncatlon of the ampimed products 

Amplific«ion products were purified by using Qiagen's PCR 
S Purifica«on kit (No. 28,061 according to manufacturer-s Instructions 
T^e eMicn o, the puHfied products from the co.un,n was done in SO ,. 
TE-buffer (lOmM Tris, 1 mM EDTA, pH 7,5). 

Amnny-capture and denaturation of the double stranded DNA 

10 one well of a streptavidin-coated microtiter plate (No. 1 645684 

Boehringer-Mannheim or No. 95029262 Ubsystems,. Subsequently 10 

.ncubation buffer (80 mM sodium phosphate, 400 mM NaCI O 4% 
Tween20, pH 7,5, and 30^ water were added. After incubation' for 1 
hour at room temperature the wells were washed three times with 200^1 
IB wash.ng buffer (40 mM Tris, 1 EDTA, 50 mM NaCI, 0. 1 Tween 
-iO, PH 8.8). To denature the double stranded DNA the wells were 
treated with 100;. of a 50 mM NaOH solution for 3 min and the we.ls 
washed three times with 200 //I washing buffer. 
OHgo base extension reaction 

20 TU 

The annealing of 2S pmol detection primer (CF508- 6' 
CTATATTCATCATAGGAAACACCA-3- ,SEQ ,0 No. ,5, was performed 

1 M .oT'"^ """^ '° '<a- n-M (NH,,,SO,. 2 

mM MgS02. ,% Triton X-,00. pH 8, a. SO-C for ,0 min. The wells 

25 y7t r*"* — <n 200 ^ 

TE buffer. The extension reaction was performed by using some 

oomponents of the DNA sequencing kit from USB (No 707701 and 

c'oT • ' T,7 ""^ « M 

conta.n,ng ' 21 water. 6p. Se,uenase*uffer. 3 M 10 mM DTT 
solution. 4,6 Ml. 0.5 mM o, «.,ee dNTPs. 4.5^. 2 mM the missing one 
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ddNTP, 5,5 lA glycerol enzyme dilution buffer. 0,25 /A Sequenase 2.0, 
and 0,25 pyrophosphatase. The reaction was pipetted on ice and then 
incubated for 15 min at room temperature and for 5 min at 37*»C. 
Hence, the wells were washed three times with 200 /A washing buffer 
5 and once with 60 //I of a 70 mM NH4-Citrate solution. 

Denaturatfon and precipitation oftite extended primer 
The extended primer was denatured in 50 /H 10%-OMSO 
(dimethylsufoxide) in water at 80«C for 10 min. For precipitation, 10 M 
NH4-Acetate (pH 6.5), 0,5 /A glycogen (10 mg/ml water, Sigma No. 
10 G1765), and 100 //I absolute ethanol were added to the supernatant and 
incubated for 1 hour at room temperature. After centrifugation at 
13.000 g for 10 min the pellet was washed in 70% ethanol and 
resuspended m ^ /A ys Mohm/cm HjO water. 

Sample preparation and analysis on MALDi-TOF mass 
1 5 spectrometry 

Sample preparation was performed by mixing 0,3 pi of each of 
matrix solution (0.7 M 3-hydroxypicolinic acid, 0.07 M dibasic 
ammonium citrate in 1:1 H^OrCHaCN) and of resuspended DNA/glycogen 
pellet on a sample target and allowed to air dry. Up to 20 samples were 
20 spotted on a probe target disk for introduction into the source region of 
an unmodified Thermo Bioanalysis (formeriy Rnnigan) Visions 2000 
MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target 
and conversion dynode, respectively. Theoretical average molecular 
mass (M,(calc)) were calculated from atomic compositions; reported 
25 experimental Mr (M,(exp)) values are those of the singly-protonated form, 
determined using external calibration. 
RESULTS 

The aim of the experiment was to develop a fast and reliable 
method independent of exact stringencies for mutation detection that 



wo 9800166 



PCT/US97/20444 



■100- 



feads to high quality and high throughput in the diagnosis of genetic 
diseases. Therefore a special kind of DNA sequencing (oligo base 
extension of one mutation detection primer) was combined with the 
evaluation of the resulting mini-sequencing products by matrix-assisted 
laser desorption ionization (MALDI) mass spectrometry (MS) The 
t.me-of-flight (TOF) reflectron arrangement was chosen as a possible 
mass measurement system. To prove this hypothesis, the examination 
was perfom^ed with exon 10 of the CFTR-gene, in whi^h some mutations 
could lead to the clinical phenotype of cystic fibrosis, the most common 
1 0 monogenetic disease in the Caucasian population. 

The schematic presentation as given in Rgure 34 shows the 
expected short sequencing products with the theoretically calculated 
molecular mass of the wildtype and various mutations of exon 10 of the 
CFTR-gene (SEQ ID No. 132). T.,e short sequencing products were 
produced using either ddTTP (Figure 34A; SEQ ID Nos. 1 33-135) or 
ddCTP (Figure 34B; SEQ ID Nos. 136-139) to introduce a definitive 
sequence related stop in the nascent DNA strand. The MALDI-TOF-MS 
spectra of healthy, mutation heterozygous, and mutation homozygous 
md.v.du.ls are presented in Figure 35. All samples were confirmed by 
standard Sanger sequencing which showed no discrepancy in 
comparison to the mass spec analysis. The accuracy of the experimental 
measurements of the various molecular masses was within a range of 
m.us 21.8 and plus 87.1 dalton (Da, to the range expected. This allows 
a definrtive interpretation of the results in each case A further 
25 advantage of this procedure is the unambiguous detection of the AI507 
mutation. In the ddTTP reaction, the wildtype allele would be detected 
Whereas in the ddCTP reaction the three base pair deletion would be ' 
disclosed. 
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The method described is highly suitable for the detection of single 
point mutations or microiesions of DNA. Careful choice of the mutation 
detection primers will open the window of multiplexing and lead to a high 
throughput including high quality in genetic diagnosis without any need 
5 for exact stringencies necessary in comparable allele-specific procedures. 
Because of the uniqueness of the genetic information, the oligo base 
extension of mutation detection primer is applicable in each disease gene 
or polymorphic region in the genome like variable numbeT of tandem 
repeats (VIMTR) or other single nucleotide polymorphisms (e.g.. 
10 apolipoprotein E gene), as also desicribed here. 

EXAMPLE 8 

Detection of Polymerase Chain Reaction Products Containing 
7-Deazapurine Moieties wfth Matrix-Assisted Laser Desorption^lonization 
'nme-of-i=nght (MALDI-TOF) Mass Spectrometry 

15 MATERIALS AND METHODS 

Nucleic acid amplifications 

The following oligodeoxynucleotide primers were either 
syntiiesized according to standard phosphoamidite chemistry (Sinha, 
N.D,. siaL. (1983) Tetrahedron Let. Vol. 24, Pp. 5843-5846; Sinha, 

20 N.D., fij aL, (1 984) Nucleic Acids Res., Vol. 1 2, Pp. 4539-4557) on a 
MilliGen 7500 DNA synthesizer (Millipore, Bedford, MA, USA) in 200 
nmol scales or purchased from MWG-Biotech (Ebersberg, Germany, 
primer 3) and Biometra (Goettingen, Germany, primers 6-7). 
primer 1 : 5'-GTCACCCTCGACCTGCAG (SEQ ID NO. 1 6); 

25 primer 2: 5'-TTGTAAAACGACGGCCAGT (SEQ ID NO. 17); 
primer 3: 5'-CTTCCACCGCGATGTTGA (SEQ ID NO. 18); 
prim r 4: 5'-CAGGAAACAGCTATGAC (SEQ ID NO. 19); 
prim r 5: 5'-GTAAAACGACGGCCAGT (SEQ ID NO. 20); 
prim r 6: 5'-GTCACCCTCGACCTGCAgC (g: RiboG) (SEQ ID NO. 21); 
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primer 7: 5'-GTTGTAAAACGAGGGCCAgT (g: RiboG) (SEQ ID NO 
22); 

The 99-mer (SEQ ID No. 141 ) and 200-mer DNA strands (SEQ ID 
No. 140; modified and unnrwdified) as well as the ribo- and 
5 7-dea2a-modified 100-mer were amplified from pRFcl DNA (10 ng 
generously supplied by S. Feyerabend, University of Hamburg) in 100 
reaction volume containing 10 mmol/L KCI, 10 mmol/L (NH^),SO 20 
mmol/L Tris HCI (pH 8.8), 2 mmol/L MgSO,, (exoi-) Pseadocoaoas 
furiosus (Pfu) -Buffer, Pharmacia, Freiburg, Germany), 0.2 mmol/L each 

10 dNTP (Pharmacia, Freiburg, Germany), 1 ^ol/L of each primer and 1 
unit of exoi.)Pfu DNA polymerase (Stratagene, Heidelberg, Germany) 
For the 99.mer primers 1 and 2, for the 200-mer primers 1 and 3 and for 
the 100-mer primers 6 and 7 were used. To obtain 7-dea2apurine 
modified nucleic acids, during PCR-ampIification dATP and dGTP were 
15 replaced with 7-deaza-dATP and 7-dea2a- dGTP. TT^e reaction was 
performed in a thermal cycler (OmniGene, MWG-Biotech, Ebersberg 
Germany) using the cycle: denaturation at for 1 min., annealing at 

51 «»C for 1 min. and extension at 72^0 for 1 min. For all PCRs the 
number of reaction cycles was 30. The reaction was allowed to extend 
20 for additional 10 min. at 72«C after the last cycle. 

The 103-mer DNA strands (modified and unmodified; SEQ ID No 
245) were amplified from M13mp18 RFI DNA (100 ng, Pharmacia 
Freiburg, Germany) in 100 ^L reaction volume, using primers 4 and 5 all 
other concentrations were unchanged. The reaction was performed 
25 using the cycle: denaturation at 95oC for 1 min., annealing at 40-0 for 
1 min. and extension at 720C for 1 min. After 30 cycles for the 
unmodified and 40 cycles for the modified 103-mer respectively, the 
sampi s were incubated for additional 10 min. at 72«>C. 
Synthesis of S'-f^-PHabeled PCR-primers 
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Primers 1 and 4 were 5'-[^*-PI-labeled employing 
T4-polynucleotidldnase (Epicentre Technologies) and (k-^P)-ATP. 
(BLU/NGG/502A, Dupont, Germany) according to the protocols of the 
manufacturer. The reactions were performed substituting 10% of primer 
5 1 and 4 in PCR with the labeled primers under otherwise unchanged 
reaction-conditions. The amplified DMAs were separated by gel 
electrophoresis on a 10% polyacrylamide gel. The appropriate bands 
were excised and counted on a Packard TRI-CARB 460C-|iquid 
scintillation system (Packard, CT, USA), 
10 Primer-cleavage from ribo-modified PCR-product 

The ampfified DNA was purified using Ultrafree-MC filter units 
(30,000 NMWL), it was then redissoived in 100 /it of 0.2 mol/L NaOH 
and heated at 95 »C for 25 minutes. The solution was then acidified with 
HC1 (1 mol/L) and furtiier purified for MALDI-TOF analysis employing 
1 5 Ultrafree-MC filter units (1 0,O00 NM WL) as described below. 
PuriricatJon of amplified products 
•s. AH samples were purified and concentrated using Ultrafree-MC 
units 30000 NMWL (Millipore, Eschborn. Gemiany) according to tiie 
manufacttjrer's description. After lyophilization, amplified products were 
20 redissoived in 5 /d. (3 //L for the 200-mer) of ultrapure water. This 
analyte solution was directiy used for MALDI-TOF measurements. 
MALDI-TOF MS 

Aliquots of 0.5 fA. of analyte solution and 0.5 pL of matrix solution 
(0.7 mol/L 3-HPA and 0.07 mol/L ammonium citrate in acetonitrile/water 
25 (1:1, v/v)) were mixed on a flat metallic sample support. After drying at 
ambient temperature the sample was introduced into the mass 
specti^omet r for analysis. Th MALDI-TOF mass spectrometer used was 
a Rnnigan MAT Vision 2000 (Hnnlgan MAT. Bremen. Germany). Spectra 
were recorded in the positiv ion reflector mode with a 5 keV ion source 
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and 20 keV postacceleration. The instrument was equipped with a 
nitrogen laser (337 nm wavelength). The vacuum of the system was 
3-4.ia» hPa in the analyzer region and 1-4.10-^hPa in the source region 
Spectra of modified and unmodified ONA samples were obtained with the 
5 same relative laser power; external calibration was performed with a 

mixture of synthetic oligodeoxynucleotides (7-to 50-mer). 
- RESULTS AND DISCUSSION 

' T/^cf'' 7^ea.spurine nucleotide containing nucieic eci<is 

10 In order to demonstrate the feasibility of MALDI-TOF MS for the 

rapid, gel-free analysis of short amplified products and to investigate the 
effect of 7-deazapurine modification of nucleic acids under MALDI-TOF 
conditions, two different primer-template systems were used to 
synthesize ONA fragments. Sequences are displayed in Figures 36 and 
37. While the two single strands of the 103-mer amplified product had 
nearly equal masses (Am= 8 u), the two single strands of the 99-mer 
.differed by 526 u. Considering that 7-deaza purine nucleotide 

building blocks for chemical DMA synthesis are approximately 160 times 
more expensive than regular ones (Product Information. Glen Research 
20.. Corporation, Steriing, VA) and their application in standard fi- 

cyano-phosphoamidite chemistry Is not trivial (Product Information Glen 
Research Corporation, Steriing, VA; Schneider et aL (1 995) JSIucLAcids 
fiss.23:1570) the cost of 7-deaza purine modified primers would be very 
h-gh. Therefore, to Increase the applicability and scope of the method, 
all PCRs were perfomied using unmodified oligonucleotide primers which 
are routinely available. Substituting dATP and dGTP by c'-dATP and 
c'-dGTP in polymerase chain reaction led to products containing 
approximat ly 80% 7-d aza-purine modified nucleosides for the 99-mer 
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and 103-fner; and about 90% for the 200-mer, respectively. Table II 
shows the base composition of all PGR products. 

TABLE II: 



Base composition of tiie 99-mer, 103-mer and 200-nier PCR am«n« 
products (unmodified and 7^<.^t« ^.^y^.' PGR ampfification 




25 



^^^^^ "sense- and "antisense" strands of the 
double-sti^anded amplified product. 

indicates relative modification as percentaae of 7.ri<.»o ^. ■ 
nucleotides of total amount of purine nucleotidel. '^"""^ 



It remained to be detemriined whetiier 80-90% 7-dea2a-purine 
modification is sufficient for accurate mass spectrometer detection It 
was therefore important to determine whetiier aH purine nucleotides 
c uld be substituted during tiie enzymatic amplification step This was 
30 not tiivial since it had been shown tiiat c'-dATP cannot fully replace 
dATP in PCR if Tag DNA polymerase is employed (Se la. F and A 
Roelling (1992) Nucleic Acids Res., 20.55-61). Fortunately it was found 
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that exo(-)/ya DNA polymerase indeed could accept c'-dATP and c^-dGTP 
■n the absence of unmodified purine nucleoside triphosphates The 
incorporation was less efficient leading to a lower yield of amplified 
product (Figure 38). 

5 To verify these results, the amplications with («PJ-Jabeled primers 

were repeated. The autoradiogram (Hgure 39) cleariy shows lower yields 
for the modified PCR-products. The bands were excised from the gel and 
counted. For all amplified products the yield of the modified nucleic 
acids was about 50%, referring to the corresponding unmodified 
10 amplification product. Farther experiments showed that exo(-)DeepVe.r 
and Vent DNA polymerase were able to incorporate c'-dATP and c'-dGTP 
during PCR as well. The overall performance, however, turned out to be 
best for the exo(-)/y. DNA polymerase giving least side products during 
amplification. Using all three polymerases. It was found that such PCRs 

IS «"^P'oyingc'-dATP and c^-dGTP instead of their isosteres showed less 
s.de-reactions giving a cleaner PCR-product. Decreased occurrence of 
amplification side products may be explained by a reduction of primer 
mismatches due to a ling template which is synthesized during PCR 
Decreased melting point for DNA duplexes containing 7-dea2a-purine 
20 have been described (Mizusawa. S. ^ aL, (I 986) Nucleic Adds Res 1 4 
1319-1324). In addition to the three polymerases speblfied above (exo(-i 
Deep Vent DNA polymerase, 5Vent DNA polymerase and exo(-) (Pfu) 
DNA polymerase), it is anticipated that other polymerases, such as the 
Large Klenow fragment of E.coli DNAi,olymerase, Se<juenase. Taq DNA 
polymerase and U AmpliTaq DNA polymerase can be used. In addition 
where RNA is the template, RNA polymerases, such as the SP6 or the T7 
RNA polymerase, must be used. 
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MALDI-TOF mass spectrometry of modified and ur^modified 
amplified products. 

The 99-mer, 103-mer and 200-mer amplified products were 
analyzed by MALDl-TOF MS. Based on past experience, it was known 
5 that the degree of depurination depends on the laser energy used for 
desorption and ionization of the analyte. Since the influence of 
7-deazapurine modification on fragmentation due to depurination was to 
be investigated, all spectra were measured at the same relative laser 

10 ^"^'^Rgures 40a and 40b show the mass spectra of the modified and 
unmodified 103-mer nucleic acids. In case of the modified 103-mer. 
fragmentation causes a broad (M+H)^ signal. The maximum of the peak 
is shifted to lower masses so that the assigned mass represents a mean 
value of (M+H)* signal and signals of fragmented ions, rather than the 
15 (M+H)* signal itself. Although the modified 103-mer still contains about 
20% A and G from the oligonucleotide primers, rt shows less 
fragmentation which is featured by much more narrow and symmetnc 
ignals. Especially peak tailing on the lower mass side due to 
depurination, is substantially reduced. Hence, the difference between 
20 measured and calculated mass is strongly reduced althoug^^^^^^^ 

below the expected mass. For the unmodified sample a (M . H) s.gna. 
of 31670 was observed, which is a 97 u or 0.3% difference to the 
calculated mass. While, in case of the modified sample this mass 
difference diminished to 10 u or 0.03% (31713 u found, 31723 u 
25 calculated). These obsen^ations are verified by a significant increase .n 
n.ass resolution of the <M + H) ^ signal of the two signal strands (n/Am = 
67 as opp sed to 18 for the unmodified sample with Am = full width at 

Recaus ofth low mass difference between the 
half maximum, fwhm). Becaus «i ui 

two single strands (8 u) their individual signals were not.esolved. 
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With the results of the 99 base pair DNA fragments the effects of 
increased mass resolution for 7-deazapurine containing DNA becomes 
even more evident- The two single strands in the unmodified sample 
were not resolved even though the mass difference between the two 
5 strands of the amplified product was very high with 526 u due to 

unequal distribution of purines and pyrimidines (figure 41a). In contrast 
to this, the modified DNA showed distinct peaks for the two single 
strands (figure 41b) which demonstrates the superiority of this approach 
for the determination of molecular weights to gel eiectrophoretic methods 

10 even more profound. Although base line resolution was not obtained the 
individual masses were able to be assigned with an accuracy of 0,1%: 
Am = 27 u for the lighter (calc. mass = 30224 u) and Am = 14 u for 
the heavier strand (calc. mass = 30750 u). Again, it was found that the 
full width at half maximum was substantially decreased for the 

15 7-deazapurine containing sample. 

In case the 99-mer and 103-mer, the 7-deazapurine containing 
nucleic acids seem to give higher sensitivity despite the fact that they 
still contain about 20% unmodified purine nucleotides. To get 
comparable signal-to-noise ratio at similar intensities for the (M + H)* 

20 signals, the unmodified 99-mer required 20 laser shots in contrast to 12 
for the modified one and the 103-mer required 12 shots for the 
unmodified sample as opposed to three for the 7-deazapurine 
nucleoside-containing amplified product. 

Comparing the spectra of the modified and unmodified 200-mer 

25 amplicons, improved mass resolution was again found for the 

7-deazapurine containing sample as well as increased signal intensities 
(Figures 42A and 42B). While the signal of the single strands 
predominates in the spectrum of the modified sample the DNA-duplex 
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and dimers of the single strands gave the strongest signal for the 
unmodified sannple. 

A complete 7-deaza purine modification of nucleic acids may be 
achieved either using modified primers in PCR or cleaving the unmodified 
5 primers from the partially modified amplified product. Since 

disadvantages are associated with modified primers, as described above, 
a 100-mer was synthesized using primers with a ribo-modification. The 
primers were cleaved hydrolytically with NaOH according to a method 
developed eariler in bur laboratory (Koester, H. et aL, Z Physiol. Chem., 
10 359. 1 570-1 589). Rgures 43A and 43B display the spectra of the 
amplified product before and after primer cleavage. Figure 43b shows 
that the hydrolysis was successful: The hydrolyzed amplified product as 
well as the two released primers could be detected together with a small 
signal from residual uncleaved 100-mer. This procedure is especially 
15 useful for the MALDI-TOF analysis of very short PCR-products since the 
.share of unmodified purines originating from the primer increases with 
decreasing length of the amplified sequence. 

The remaricable properties of 7-deazapurine modified nucleic acids 
can be explained by either more effective desorption and/or ionization, 
20 increased ion stability and/or a lower denaturation energy of the double 
stranded purine modified nucleic acid. The exchange of the N-7 for a 
methyl group results in the loss of one acceptor for a hydrogen bond 
which influences the ability of the nucleic acid to form secondary 
structures due to non-Watson-Crick base pairing (Seela, F. and A. Kehne 
25 (1987) fi/oc/»e/n»io^, 26, 2232-2238.). In addition to this the aromatic 
system of 7-deazapurine has a lower electron density that weakens 
Watson-Crick base pairing resulting in a decreased melting point 
(Mizusawa, S. jtaL, (1986) Nucleic Acids Res. . 14. 1319-1324) of the 
double-strand. This effect may decrease the energy needed for 
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denaturation of the duplex in the MALDI process. These aspects as well 
as the loss of a site which probably will carry a positive charge on the 
N-7 nitrogen renders the 7-dea2apurine modified nucleic acid less polar 
and may promote the effectiveness of desorption. 
5 Because of the absence of N-7 as proton acceptor and the 

decreased polarization of the C-N bond in 7-dea2apurine nucleosides 
depurination following the mechanisms established for hydrolysis in 
solution is prevented. Although a direct correlation of reactions in 
solution and in the gas phase is problematic, less fragmentation due to 
10 depurination of the modified nucleic acids can be expected in the MALDI 
process. Depurination may either be accompanied by loss of charge 
which decreases the total yield of charged species or it may produce 
charged fragmentation products which decreases the intensity of the non 
fragmented molecular ion signal. 
15 The observation of increased sensitivity and decreased peak tailing 

of the (M + H)* signals on the lower mass side due to decreased 
fragmentation of the 7-dea2apurine containing samples indicate that the 
N-7 atom indeed is essential for the mechanism of depurination in the 
MALDI-TOF process. In conclusion. 7.dea2apurine containing nucleic 
20 acids show distinctly increased ion-stability and sensitivity under 

MALDI-TOF conditions and therefore provide for higher mass accuracy 
and mass resolution. 

EXAMPLE 9 

Solid Phase Sequencing and Mass Spectrometer Detection 
25 MATERIALS AND METHODS 

Oligonucleotides were purchased from Operon Technologies 
(Alameda, CA) in an unpurified form. Sequencing reactions were 
P rf rmed on a solid surface using reagents from the sequencing kit for 
Sequenase Version 2.0 (Amersham. Arlington H ights, Illinois). 
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Sequencing a SS-mer target 
Sequencing complex: 

SEQUENCE SEQ ID 

NO. 

5 5'-TCTGGCCTGGTGCAGGGCCTATTGTAGTTGTGACGTACA-(A«1,-3' 23 
5'"TGTACGTCACAACT-3' (PNA 16/DNA) . 24 

In order to perform solid-phase DNA sequencing, tomplate strand 
DNA1 1683 was 3'-biotinylated by terminal deoxynucieotidyl transferase. 

10 A 30 //I reaction, containing 60 pmol of DNA1 1 683, 1 .3 nmol of biotin 
14-dATP (GIBCO BRL, Grand Island, NY), 30 units of terminal transferase 
(Amersham, Arlington Heights, Illinois), and 1x reaction buffer (supplied 
with enzyme), was incubated at 37**C for 1 hour. The reaction was 
stopped by heat inactivation of the terminal transferase at 70**C for 10 

15 min. The resulting product was desalted by passing through a TE-10 
spin column (Clontech). More than one molecules of biotin- 14-dATP 
could be added to the 3'-end of DNA1 1 683. The biotinylated 
DNA11683 was incubated with 0.3 mg of Dynal streptavidin beads in 30 
//I 1x binding and washing buffer at ambient temperature for 30 min. 

20 The beads were washed twice with TE and redissolved in 30 //I TE, 1 0 //I 
aliquot (containing 0. 1 mg of beads) was used for sequencing reactions. 

The 0,1 mg beads from previous step were resuspended in a 10//! 
volume containing 2 //I of 5x Sequenase buffer (200 mM Tris-HCI, pH 
7,5, 100 mM MgCl2, and 250 mM NaCI) from the Sequenase kit and 5 

25 pmol of corresponding primer PNA 16/DNA. The annealing mixture was 
heated to 70®C and allowed to cool slowly to room temperature over a 
20-30 min time period. Then 1 //I 0.1 M dithiothreitol solution, 1 //I Mn 
buffer (0.15 M sodium isocitrate and 0.1 M MgCy, and 2 fA oi diluted 
Sequenase (3,25 units) were added. The reaction mixture was divided 
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•nto four aliquots of 3 /4 each and mixed with termination mixes (each 
contains of 3 //I of the appropriate termination mix: 32 m c7dATP 32 
m dCTP, 32 m c7dGTP, 32 dTTP and 3.2 //M of one of the four 
ddTNPs, in 50 mlVI NaCI). The reaction mixtures were incubated at 37oc 
for 2 min. After the completion of extension, the beads were 
precipitated and the supernatant was removed. The beads were washed 
twice and resuspended in TE and kept at 4«»C. 

Sequencing a 78-mer target 
Sequencing complex: 

5'-AAGATCTGACCAGGGATTCGGTTAGCGTGACTGCTGCTGCTGCTGCT 

GCTGCTGGATGATCCGACGCATCAGATCTGG-(A")„-3'(SEQ ID NO 25) 
(TNR.PLASM2) ' 

5'-CTGATGCGTCGGATCATC-3' (CM1) (SEQ ID NO. 26) 

The target TNR.PLASM2 was biotinylated and sequenced using 
procedures similar to those described in previous section (sequencing a 
39-nier target). 

Sequencing a 15-mer target with partially duplex probe 
Sequencing complex: 
*'-F-GATGATCCGACGCATCACAGCTC3' (SEQ ID No. 27) 
20 *'-TCGGTTCCAAGAGCTGTGATGCGTCGGATCATC-b-3' (SEQ ID No. 28) 
CM1B3B was immobilized on Dynabeads M280 with streptavidin 
(Dynal. Norway) by incubating 60 pmol of CM1B3B with 0.3 magnetic 
beads in 30;. 1 M NaCI and TE (1x binding and washing buffer) at room 
temperature for 30 min. The beads were washed twice with TE and 
25 redissolved in 30,il TE, 10 or 20^ aliquot (containing 0.1 or 0.2 mg of 
beads respectively) was used for sequencing reactions. 
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The duplex was formed by annealing corresponding aliquot of 
beads from previous step with 10 pmol of DF1 1a5F (or 20 pmol of 
DFl 1a5F for 0,2 mg of beads) in a9 /A volume containing 2 of 5x 
Sequenase buffer (200 mM Tris-HCI, pH 7.5, 100 mM MgCI^, and 250 
5 mM NaCI) from the Sequenase kit. The annealing mixture was heated to 
65<>C and allowed to cool slowly to 37*>C over a 20-30 min time period. 
The duplex primer was then mixed with 10 pmol of TSIO (20 pmol of 
TS10 for 0,2 mg of beads) in 1 /d volume, and the resulting mixture was 
further incubated at 37**C for 5 mIn, room temperature for 5-10 min. 
10 Then 1 //I 0.1 M dithiothreitol solution, 1 fA Mn buffer (0,15 M sodium 
isocitrate and 0.1 M MnCy, and 2 a/I of diluted Sequenase (3,25 units) 
were add^. The reaction mixture was divided into four aiiquots of 3 /j\ 

ach and mixed with termination mixes (each contains of 4 pi of the 
appropriate termination mix: 16//M dATP, 16//M dCTP, 16//M dGTP, 16 
15 pM dTTP and 1 .6 //M of one of the four ddNTPs, in 50 mM NaCI). The 
reaction mixtures were incubated at room temperature for 5 min, and 
37®Cfor5min. After the completion of extension, the beads were 
precipHated and the supernatant was removed. The beads were 
resuspended in 20 fA TE and kept at A^C. An aliquot of 2 //I (out of 20 
20 fA) from each tube was taken and mixed with 8 //I of formamide, the 

r suiting samples were denatured at 90-95*^0 for 5 min and 2 //I (out of 
lOpI total) was applied to an ALF DNA sequencer (Pharmacia, 
Piscataway, NJ) using a 10% pdlyacrylamide gel containing 7 M urea 
and 0,6x TBE. The remaining aliquot was used for MALDI-TOF MS 
25 analysis. 
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MALDI sample preparation and instrumentation 
Before MALDI analysis, the sequencing ladder loaded magnetic 
beads were washed twice using 50 mM ammonium citrate and 
i^suspended in 0.5 ^ pure water. The suspension was then loaded onto 
the sample target of the mass spectrometer and 0.5 /A of saturated 
matrix solution (3-hydroxypicolinic acid (HPA): ammonium citrate ^ i o 1 
mole ratio in 50% acetonitrile) was added. The mixture was allowed to 
dry pnor to mass spectrometer analysis. 

10 M.T r ^"'^''^ """"" spectrometer (Vision'2000, Finnigan 

MAT, Bremen, Germany) was used for analysis, 5 kV was applied in the 
.on source and 20 kV was applied for postacceleration. All spectra were 
taken m the positive ion mode and a nitrogen laser was used. Normally 
each spectrum was averaged for more than 100 shots and a standard ' 
25-point smoothing was applied. 
15 RESULTS AND DISCUSSION 

Conventional solid-phase sequencing 

In conventional sequencing methods, a primer is directly annealed 
to the template and then extended and terminated in a Sanger dideoxy 

20 "7"""^' Primer is used and the seque^^^ 

ladders are captured by streptavidin-coated magnetic beads. After 
washing, the products are eluted from the beads using EOTA and 
fomiamide. Previous findings indicated that only the annealed strand of 
a duplex is desort,ed and the immobilized strand remains on the beads 
Therefore, It is advantageous to immobilize the template and anneal the 
pnmer. After the sequencing reaction and washing, the beads with the 
immobilized template and annealed sequencing ladder can be loaded 
directly onto the mass spectrometer target and mix with matrix In 
MALDI, nly the annealed sequencing ladder will be desorbed and 
ionized, and the immobiliz d templet will remain on th target 
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A 39-mer template (SEQ ID No. 23) was first biotinylated at the 
3'-end by adding blotln-1 4-dATP with terminal transferase. More than 
one blotin-1 4-dATP molecule could be added by the enzyme. Since the 
template was immobilized and remained on the beads during MALDI, the 
number of biotin-l 4-dATP would not affect the mass spectra. A 14-mer 
primer (SEQ ID No. 24) was used for the solid-state sequencing to 
generate DNA fragments 3-27 below (SEQ ID Nos. 142-166) 
MALDI-TOF mass spectra of the four sequencing ladders are shown in 
Figure 44 and the expected theoretical values are shown in Table III. 
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TABLE III 





1 


5'-TCTGGCCTGGTGCAGGGCCTATTGTAGTTGTGACGTACA-{A^)„-3' 




2 


3'-TCAACACTGCATGT-5- 




3 


3'.ATCAACACTGCATGT-6' 


5 


4 


3'-CATCAACACTGCATGT-5' 




5 


3'-ACATCAACACTGCATGT-6' 




6 


3'-AACATCAACACTGCATGT-6' 




7 


3'-TAACATCAACACTGCATGT-5' 




8 


3'-ATAACATCAACACTGCATGT-5' 


10 


9 


3'-GATAACATCAACACTGCATGT-5' 




10 


3'-GGATAACATCAACACTGCATGT-5' 




11 


3'-CGGATAACATCAACACTGCATGT-5' 




12 


3'-CCGG ATAACATCAACACTGCATGT-5 ' 




13 


3'-CCCGGATAACATCAACACTGCATGT-5' 


15 


14 


3'-TCCCGGATAACATCAACACTGCATGT-5' 




15 


3'-GTCCCGGATAACATCAACACTGCATGT-5' 




16 


3'-CGTCCCGG ATAACATCAACACTGCATGT-5 ' 


- 


17 


3'-ACGTCCCGGATAACATCAACACTGCATGT-5' 




18 


3'-CACGTCCCGGATAACATCAACACTGCATGT-5' 


20 


19 


3'-CCACGTCCCGGATAACATCAACACTGCATGT-5' 




20 


3'-ACCACGTCCCGGATAACATCAACACTGCATGT-5' 




21 


3'-GACCACGTCCCGGATAACATCAACAeTGCATGT-5' 




22 


3'-GGACCACGTCCCGGATAACATCAACACTGCATGT.5' 




23 


3'-CGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 


25 


24 


3'-CCGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 




25 


3'-ACCGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 




26 


3'-GACCGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 




27 


3'-AGACCGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 
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TABLE III (Continued) 









C-reaction 


G-reaction 


T-reaction 




1. 












2. 




4223.8 


4223.8 


4223.8 


5 1 


3. 


4921 .1 








1 4. 




4309.2 






1 5* 












6. 


o434,0 










7. 








5737.8 


10 


8. 


oOol .1 










9, 




• - 


6379.2 






10. 






6704.4 






11. 




6995.6 








12. 




7284.8 






15 


13. 




7574.0 








14, 








7878.2 










8207.4 






16. 




8495.6 








17. 


8808.8 








20 


18. 




9097.0 








19. 




9386.2 








20. 


9699,4 










21. 






10027.6 






22. 






10355.8 




25 


23. 




10644.0 








24, 




10933.2 








25. 


11246.4 
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A-reaction 


C-reaction 


G-reaction 


T-reaction 


26. 






11574,6 




27. 


11886.8 









The sequencing reaction produced a relatively homogenous ladder, 
5 and the full-length sequence was determined easily. One peak around 
5150 appeared in all reactions are not identified. A possible explanation 
is that a small portion of the template formed some kind of secondary 
structure, such as a loop, which hindered sequenase extension. 
Mis-incorporation is of minor importance, since the intensity of these 

10 peaks were much lower than that of the sequencing ladders. Although 
y-deaza purines were used in the sequencing reaction, which could 
stabilize the N-glycosidic bond and prevent depurination, minor base 
losses were still observed since the primer was not substituted by 
7-deazapurines. The full length ladder, with a ddA at the 3' end, 

15 appeared in the A reaction with an apparent mass of 1 1899.8, A more 
intense peak of 12333 appeared in ail four reactions and is likely due to 
an addition of an extra nucleotide by the Sequenase enzyme. 

The same technique could be used to sequence longer DNA 
fragments. A 78-mer template containing a CTG repeat (SEQ ID No. 25) 

20 was 3'-biotinylated by adding biotin-14-dATP with terminal transferase. 
An 1 8-mer primer (SEQ ID No, 26) was annealed right outside the CTG 
repeat so that the repeat could be sequenced immediately after primer 
extension. The four reactions were .washed and analyzed by 
MALDI-TOFMS as usual. An example of the G-reaction is shown in 

25 Rgure 45 (SEQ ID Nos. 1 67-220) and the expected sequencing ladder is 
shown in Table fV with theoretical mass values for each ladder 
component. All sequencing peaks were well resolved except the last 
c mponent (theoretical value 20577.4) was indistinguishable from the 
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bacRflround. Two neighboring scuencinfl peaks (a 62-mer and a 
6^r, we« alao separated indica.n« that such se<,ueno.ng ana,vs,s 
!LT e appnca^e » .onger .en.p.a«s. Again, an addi«on o, an ex«a 
n^Le W tKe Se^uenase enzyn. was obsen,ed in this speouun,. 
Z addi^on is not .en*.a.e spoci«o and appeared in a« ,ou, reac«ons 
^ ^ i. easy to be iden«,ied. Con^ared .0 *a pHn-er pea^*e 
scuencing peatcs w«a a. n-uch .ower inunsity in *a long ten,p.=.e 



case. 
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TABLE IV Continued 



ddATP 



ddCTP 
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1 


20890.6 








i 








21194.4 1 


1 




21484.0 






I 54. 
1 55. 








21788.2 1 
22092.4 II 



10 



15 



— r--~""Sf aiiu pnming 
Duplex DNA probes With single-stranded overhang have been 
demonstrated to be able to capture specific DNA templates and also 
serve as primers for solid-state sequencing. The scheme is shown in 
Rgure 46. Stacking interactions between a duplex probe and a 
single-stranded template allow only a 5-base overhang to be sufficient for 
captunng. Based on this format, a 5' fluorescent-labeled 23-mer (5'-GAT 
GAT CCG ACG CAT CAC AGC TC-3') (SEQ ID No. 29) was annealed to 
a 3'-biotinylated 18-mer (5'-GTG ATG CGT CGG ATC ATC-3') (SEQ ID 
No. 30). leaving a 5-base overhang. A 15-mer template (5'-TCG GTT 
CCA AGA GCT-3') (SEQ ID No. 31) was captured by the duplex and 
sequencing reactions were performed by extension of the 5-base 
overhang. MALDI-TOF mass spectra of the reactions are shown in Rgure 
47A-D. All sequencing peaks were resolved although at relatively low 
.ntensities. The last peak in each reaction is due to unspecific addition of 
one nucleotide to the full length extension product by the Sequenase 
enzyme. For comparison, the same products were run on a conventional 
DNA sequencer and a stacking fluorogram of the results is shown in 
25 Figure 48. As can be seen from the Figure, the mass spectra had the 
ame pattern as the fluorogram with sequencing peaks at much lower 
intensity compared to the 23-mer primer. 
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EXAMPLE 10 
Thermo Sequenase Cyde Sequencing 
MATERIALS AND METHODS 

PCR amplification. Human leukocytic genomic DNA was used for 
5 PCR amplification. PCR primers to amplify a 209 bp fragment of the 
G-globin gene were the B2 forward primer (5'-CAT TTG CTT CTG ACA 
CAA CTG-3' SEQ ID NO. 32) and the S1 1 reverse primer (5'-CTT CTC 
TGT CTC CAC ATG C-3' SEQ ID NO, 33), Taq polymerase and lOx 
buffer were purchased from Boehringer-Mannheim (Germany) and dNTPs 
10 from Pharmacia (Freiburg, Germany). The total reaction volume was 50 
//I including 8 pmol of each primer with approximately 200 ng of genomic 
DNA used as template and a final dNTP concentration of 200 //M, PCR 
conditions were: 5 min at 94*^C, followed by 40 cycles of 30 sec at 
94«*C, 45 sec at BS^C, 30 sec at 72<*C, and a final extension time of 2 
15 min at 72°C. The generated amplified product was purified and 
concentrated (2x) with the Qiagen 'Qiaquick' PCR purification kit 
(#28106) and stored in HjO. 

Cycle Sequencing. Sequencing ladders were generated by 

primer extension with Thermo Sequenase™ -DNA Polymerase (Amersham 
20 UFE Science, #E79000Y) under the following conditions: 7 pmol of HPLC 
purified primer (Cod5 12mer: 5'-TGC ACC TGA CTC-3' SEQ ID No, 34) 
were added to 6//I purified and concentrated amplified product (i.e. 1 24A 
of the original amplified product), 2.5 units Thermo Sequenase and 2.5 
mi Thermo Sequenase reaction buffer in a total volume of 25//I. The final 
25 nucleotide concentrations were of the appropriate ddNTP (ddATP, 

ddCTP, ddGTP or ddTTP; Pharmacia Biotech, #27-2045-01) and 210//M 
of ach dNTP (7-deaza-dATP, DCTP, 7-dea2a-GTP, dTTP; Pharmacia 
Biotech). 
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Cycling conditions were: denaturation for 4 min at 940c fo>, . 
.V 3B . 30 sec « «4.C. 30 sec a. aS-C. 30 sel ^^.J^Zl 

flnaf extension of 2 min at 72»C. 

Sample preparation and analysis by IVIAU)l-TOF MS After 

SO^ by add,»on o, 26^ „.o. Desaitina was acbieved by shaking 30^ 
Of ammonium saturated DOWEX (Huka #44485) cat.v>r, - u 

Wit. , r::rrrrrjrr 

u„«°r '° -«hed witH H,0 

un«*e supernatant was neutral, and finally put ,„ ,o mM ammonium 
c.uate for usage. After «,e cation exchange. ONA was purified and 
concentrated by ethano, precipit,«on by adding ^ 3 M ammonium 
acetate ,pH 6.5,. O.B M fllycogen „0 mg/ml. Sigma,, and „ Op, 
absolute athano. ,0 the analyte and incubated a. room temperature for , 

"I 70 ^ ed,ano, and resuspended ini ^.^ ,8 Mohm/cm H,0 water 

/-"'^'•DI-TOF MS analysis 0.35 M Of resuspended ONA was 

20 .lilrtr'"' ' "^'"^ ^'^ 3-bydroxypicolinic acid 

<^PA,. O 07 M ammonium citrate in H,0:CH,CN, on a stainless 
«eel sample target disk and allowed to air ^ry preceding spectrum 
acqu,s«,on using a Them,o Bioanalysis Vision 2000 MALDI-TOF operated 
■n reflectron mode with 5 and 20 kv on the targe, and conversion 

SB ^««"«"=»'»>^»*»'9onera,ed from eight peaks 

25 (3000-18000 Da, was used for all spectra. «9ht peaks 

RESULTS 

FIGURE 49 shows a MALDI-TOF mass spectrum of tHe se<,uencing 
tadder generated from a biological amplified product as template andT 
tamer ,5--TGC ACC TGA CTC-3-,SEO ID N0.34,, se<.uencing prilr 
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The peaks resulting from depurinatlons and peaks which are not related 
to the sequence are marked by an asterisk. MALDI-TOF MS 
measurements were taken on a reflectron TOF MS. A.) Sequencing 
ladder stopped with ddATP; B.) Sequencing ladder stopped with ddCTP; 
5 C.) Sequencing ladder stopped with ddGTP; D.) Sequencing ladder 
stopped with ddTtP. 

FIGURE 50 shows a schematic representation of the sequencing 
ladder generated in Rg. 49 with the corresponding calculated molecular 
masses up to 40 bases after the primer (SEQ ID Nos 221-260). For the 
10 calculation the following masses were used: 3581.4 Da for the primer, 
312.2 Da for 7-deaza-dATP, 304.2 Da for dTTP, 289.2 Da for dCTP and 
328.2 Da for 7-deaza-dGTP. 

FIGURE 51 shows the sequence of the amplified 209bp amplified 
product within the G-globin gene (SEQ ID No. 261), which was used as a 
1 5 template for sequencing. The sequences of the appropriate PGR primer 
and the location of the 12mer sequencing primer is also shown. This 
sequence represents a homozygote mutant at the position 4 after the 

primer. In a wildtype sequence this T would be replaced by an A. 

EXAMPLE 11 

Mijf?if ioc t/^^'r'^ "^"^ ''""'"^•^ Oligo Base Extension (PROBE) and 
MALDI-TOF Mass Spectrometry 

SUMMARY 

The method uses a single detection primer followed by an 
oligonucleotide extension step to give products differing in length by a 
25 number of bases specific for the number of repeat units or for second 
site mutations within the repeated region, which can be easily resolved 
by MALDI-TOF mass spectrom try. The method is demonstrated using 
as a model system the AluVpA polymorphism in intton 5 of the 
interferon-a r ceptor gene located on human chromosome 21. and the 
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poly T tract of the splice acceptor site of intron 8 from the CFTR gene 
located on human chromosome 7. 
MATERIALS AND METHODS 

Genomic DNA was obtained from 1 8 unrelated individuals and one 
5 family including of a mother, father, and three children. The repeated 
region was evaluated conventionally by denaturing gel electrophoresis 
and results obtained were confimied by standard Sanger sequencing 

The primers for PGR amplification (8 pmol each) were 
IFNAR-IVS5-5': (5'-TGC TTA CTT AAC CCA GTG TG-3'SEQ ID NO 35, 
10 and IFNAR-IVS5.3'.2: (5'-CAC ACT ATG TAA TAC TAT GC-3' SEQ ID 
Na36) for a part of the Intron 5 of the interferon-a receptor gene, and 
CFEx9-F:(5'.GAA AAT ATC TGA CAA ACT CAT C-3' SEQ ID. N0.37) 
(5 -biotinylated) and CFEx9-R:(5'-CAT GGA CAC CAA ATT AAG 
TO-3'SEQ ID. M0.38) for CFTR axon 9 with flanking intron sequences 
Of the CFTR gene. Taq-polymerase Including lOx buffer were purchased 
from Boehringer-Mannheim and dNTPs were obtained from Pharmacia 
The total reaction volume was 50 //I . PCR conditions were 5 min at 
94«'C followed by 40 cycles of: 1 min at 940C, 45 sec at 530C, and 30 
sec at 72«»C, and a final extension time of 5 min at 72 «C. 
20 Amplification products were purified using Qiageri's PCR 

purification kit (No.28106) according to manufacturer's Instructions 
Punfied products were eluted from the column in 50 pi TE-buffer (lOmM 
Tris-HCI, 1 mM EDTA, pH 7,5). 

A) Primer oligo base extension reaction (thermo cycling method 
CyclePROBE was performed with 5 pmol appropriate detection 
Primer (IFN:5'-TGA GAC TCT GTC TC-3'SEQ ID. N0.39) in a total 
V lume of 25 lA including I pmol purified template, 2 units 
Th rmos quenase (Amersham Ufe Science, Cat. #E79000Y) 2 5^ 
Th rmosequenase buffer. 25 MTiol of ach deoxynucleotide 
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(7-dea2a-dATP, dTTP, and in some experiments extra dCTP) and 100 
/miol of dideoxyguanine and In some experiments additional ddCTP. 
Cycling conditions: initial denaturatlon 94*'C for 5 min followed by 30 
cycles with 44«»C annealing temperature for 30 sec and 55<»C extension 
5 temperature for 1 min. 

Primer oligo base extension reaction (isottiermai method) 
10//I aliquots of the purified double-stranded amplified product 
(~3 pmol) were transferred to a streptavidin-coated microliter plate well 
< ~ 1 6 pmol capacity per 50 fA volume; No. 1 645684 
10 Boehringer-Mannheim), followed by addition of 10 //I incubation buffer 
(80 mM sodium phosphate, 400 mM NaCI. 0.4% Tween 20. pH 7.5) and 
30 lA water. After incubation for 1 hour at room temperature, the wells 
were washed three times with 200 fA washing buffer A (40 mM Tris, 1 
mM EDTA, 50 mM NaQ, 0.1% Tween 20, pH 8.8) and incubated with 
15 100 of 50 mM NaOH for 3 min to denature the double-stranded ONA. 
Finally, the wells were washed three times with 200 fA 70 mM 
ammonium citrate solution. 

The annealing of 100 pmol detection primer (CFpT: 5'-TTC CCC 
AAA TCC CTG-3' SEQ ID NO, 40) was performed in 50 //I annealing 
20 buffer (50 mM ammonium phosphate buffer, pH 7.0 and 100 mM 

ammonium chloride) at 65«C for 2 min, at 37«C for 10 min, and at room 
temperature for 10 min. The wells were washed three times with 200 /A 
washing buffer B (40 mM Tris, 1 mM EDTA, 50 mM NH4CI, 0.1% Tween 
20, pH 8.8) and once in 200iErf TE buffer. The extension reaction was 
performed using some components of the DNA sequencing kit from USB 
(No. 70770) and dNTPs or ddlSlTPs from Pharmacia. Total reaction 
volume was 45 fA. containing of 21 fA water. 6 fA Sequenase-buffer, 3 fA 
100 mM DTT solution, 50 ^ol of 7-deara-dATP, 20 /miol ddCTP, 5.5 //| 
glycer I enzyme dilution buff r, 0.25 fA Sequ nase 2.0, and 0.25 fA 
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pyrophosphatase. The reaction was pipetted on ice and incubated for 1 5 
mm at room temperature and for 5 min at 37oc. Finally^the wells were 
washed three times with 200 //I washing buffer B. 
^ The extended primer was denatured from the template strand bv 

5 heating at 80OC for 10 min in 50^1 Of a 50 mM ammonium hydroxide 
solution. ■ '""^'"^ 

- (10 mg/ml water, S.gma, Cat.#G1765,, and 110^ absolute ethano. were 
added to the supernatant and incubated for 1 hour at room temperature 
After centrifugation at 1 3.000 g for 10 min the pellet was washed in 
70% etiianol and resuspended in 1 //I 18 Mohm/cm H,0 water. 

Sample preparation was performed by mixing 0.6 pi of matrix 
solution (0.7 M S-hydroxypicolinic acid, 0.07 M dibasic ammonium 
citrate in 1:1 H,0:CH3CN) witi. 0.3^ of resuspended DNA/glycogen 
pellet on a sample target and allowed to air dry. Up to 20 samples were 
spotted on a probe target disk for introduction into the source region of a 
TTiermo B.oanalysis (formeriy Finnigan) Visions 2000 MALDI-TOF 
operated in reflectron mode witfi 5 and 20 kV on the target and 

20 "r/Tir ■^'^--ti- rnolecular mass 

20 :.<M,(calc)) were calculated from atomic compositions; reported 

xperimental (M,(exp,) values are tiiose of ti,e singly-protonated form 
d termined using external calibration. 
RESULTS 

The aim of the experiments was to develop a fast and reliable 
method for the exact determination of tiie number of repeat units in 
m.crosatellites or the length of a mononucleotide stretch Including tiie 
potential to detect second site mutations witi,in ti,e polymorphic r gion 
Ther fore, a special kind of DNA sequencing (primer oHgo base 
xt nsl n. PROBE) was combined witi, ti,e evaluation of the resulting 
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products by matrix-assisted laser desorption ionization (MALOl) mass 
spectrometry (MS). The time-of-flight (TOF) reflectron arrangement was 
chosen-as a possible mass measurement system. As an initial feasibility 
study, an examination was performed first on an AluVpA repeat 
B polymorphism located in intron 5 of the human interferon-a receptor gene 
(cyclePROBE reaction) and second on the poly T tract located in intron 8 
of the human CFTR gene (isothermal PROBE reaction). 

A schematic presentation of the cyclePROBE experiment for the 
AluVpA repeat polymorphism is given in Figure 52. The extension of tfie 
10 antisense strand (SEQ ID No. 262) was performed with the sense strand 
serving as the template. The detection primer is underiined. In a family 
study co-dominant segregation of the various alleles could be 
demonstrated by the elOctrophoretic procedure as well as by the 
cyclePROBE method followed by mass spec analysis (Figure 53). Those 
15 alleles of the mother and child 2, for which direct electrophoresis of the 
amplified product indicated one of the two copies to have 1 3 repeat 
units,were measured using cyclePROBE to have instead only 1 1 units 
using ddG as terminator. The replacement of ddG by ddC resulted in a 
further unexpected short allele with a molecular mass of approximately 
20 1 1 650 in the DNA of the mother and child 2 (Rgure 54). Sequence 

analysis verified this presence of two second site mutations in the allele 
with 13 repeat units. The first is a C to T transition in the third repeat 
unit and the second mutation is a T to G transversion in the ninth repeat 
unit. Examination of 28 unrelated individuals shows that the 13 unit 
25 allele is spliced into a nonnal allele and a truncated allele using 

cyclePROBE. Statistical evaluation shows that the polymorphism is in 
Hardy-W inberg quilibrium for both methods, however, using 
cyclePROBE as detection method the polymorphism information content 
is increased to 0.734. 
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PROBE was also used as an .•sothemilc method for the detection of 
the three common alleles at the intron 8 splice acceptor site of the CFTT? 
gene (SEQ ID No. 263). Rgure 55 shows a schematic presentation of 
the expected diagnostic products {SEQ ID Nos. 264-266) with the 
5 theoretical mass values. The reaction was also performed in the 
antisense direction. 

Rgure 56 demonstrates that all three common alleles (TS T7 and 
T9, respectively) at this locus could be reliably disclosed by this meihod 
Reference to Rgure 56 indicates that mass accuracy and precision with ' 
10 the reflectron time of flight used in this study ranged from 00.4% with 
a relative standard deviation of 0.13%. This corresponds to far better 
than single base accuracy for the up to <90-mer diagnostic products 
generated in the IFNAR system. Such high analytical sensitivity is 
sufficient to detect single or multiple insertion/deletion mutations within 
the repeat unit or its flanking regions, which would induce > 1 % mass 
shifts in a 90-mer. This is analogous to the Rgure 56 polyT tract 
analysis. Other mutations (i.e. an A to T or a T to A mutation within the 
tFNAR gene AST repeat) which do not cause premature product 
termination are not detectable using any dNTP/ddNTP combination with 
PROBE and low performance MS instrumentation; a 9 Da shift in a 
90-mer corresponds to a 0.03% mass shift. Achieving the accuracy and 
precision required to detect such minor mass shifts has been 
demonstrated witii higher performance instrumentation such as Fourier 
transform (FT)MS, for which single Da accuracy is obtained up to 
25 100-mers. Further, tandem FTMS, in which a mass shifted fragment can 
be isolated witf^in tiie instrument and dissociated to generate sequence 
specific fragments, has been demonsti^ated to locate poim mutations to 
the bas in comparably sized products. Thus the combination of PROBE 
witfi high r perffomnance instrumentation will have an analytical 
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EXAMPLE 12 

MATERIALS AND METHODS 

PCR amp/ff/cation. 

Human leukocytic genomic DNA from 100 annn»«, 

apo E «en3 were de,i„e«ed acconiin, .„ *e punished sequel ,o« 
HK a aU. (, 985) i.aoUaiSm. 262:6240-6247. (forwardT 

polymerase and lOx botfer were parchesed fro,„ Boeh " 
-™ar,v, and d.^ .reiror ZTlrr 

siempiate. Solutions were heated to 80 <»C before* 
a.<..o„ „, ,u pcy^rase; PC« co„d«o„s were, a ^„ a't;!* 

Trelr," °r " - - 30 se'c a. 

'^^ and a final extension time of 2 min at 720C 

Co, and Rsa. and reaction buffer L were purchased fronT 
Boehnnger-Mannhelm, and Hhal from Phamiacia (Freiburo r 
For CM alone and simultaneous «oMUa. dl« stiT to '; 
products were diluted wHb , 5 M water and l^^'Z'^^^ 
-er . a.er addlt^n ot ,0 units „t approp Jte T^ZZ^!: 
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the samples were incubated for 60 min at 37oc. The procedure for 
s.n,ultaneous Hhal/Rsal digestlor, required first digestion by Rsal in buffer 

for one hour followed by addition of NaCI (50 niM end concentration) 
and Hhal, and additional incubation for one hour. 20 pL of the restriction 
5 d'9est were analyzed on a 12% polyacrylamide gel as described 
elsewhere (Hixson (1990) J^^ieidJBes. 31:545-548). Recognition 
sequences of Rsal and Cfol (Hhal) are GT/AC and GCG/C, respectively 
masses of expected digestion fragments from the 252-mer amplified 

10 ZT hT Simultaneous double digest with Cfol (or 

lO Hhal) and Rsal are given in Table V. 

T/iermo-PROBE. 

PGR amplification was performed as described above, but with 
products purified with the Qiagen' Qiaquick' kit to remove 
unincorporated primers. Multiplex Them,o-PROBE was performed with 
35 ^ amplified product and 8 pmol each of the codon 112 (5'.GCG GAC 
ATG GAG GAC GTG-3' SEQ ID. N0.43) and 158 (5'-GAT GCC GAT 
GAC CTG CAG AAG-3' SEQ ID. N0.44) detection primers in 20 pi 
-ncluding ~1 pmol purified biotinylated antisense template immobilized 
on s^eptavidin coated magnetic beads, 2.5 units Thermosequenase 
Thermosequenase buffer, 50 pM of each dNTP and 200 pM of ' 
ddXTP, with the base identity of N and X as described in the text 
Cycling conditions were: denaturation (94«C, 30 sec) followed by 30 
cycles at 94<>C (10 min) and 60«>C (45 sec). 

Samp/e preparation and analysis by MALDI-TOF MS. 
25 ''°^P^«*=JP«tation (Stults era/., (1991) BaeidCom^^ 

Seec^S: 359-363) of both digests and PROBE products, 5;^ 3M 
ammonium acetate (pH 6.5), 0.5^ glycogen (10 mg/ml, Sigma), and 1 10 

absolute than I were added to 50 ^ of the analyte solutions and 
stored for 1 hour at room temperature. After 10 min centrifugation at 



wo 98/20166 



PCT/US97/20444 



-137- 



13.000 X g the pellet was washed in 70% ethanol and resuspended in 1 
//I 18 Mohm/cm H^O water. Where noted in the text, additional desalting 
was achieved by shaking 10-20//L of ammonium saturated DOWEX 
(Fluka #44485) cation exchange beads in 40//L of analyte. The beads, 
5 purchased in the protonated form, were pre-treated with three 5 min 
spin-decant steps in 2M NH4OH, followed with H^O and 10 mM 
ammonium citrate. 

O.SbpL of resuspended DNA was mixed with 0,35-1. 3//L matrix 
solutions (Wu fit sL (1 993) Rapid Commun. Mas^. R poo^ — ^ 7:142-146) 
10 0.7 M 3-hydroxypicolinic acid (3-HPA), 0.07 M ammonium citrate in 1:1 
HjOiCHgCN) on a stainless steel sample target disk and allowed to air dry 
preceding spectrum acquisition using a Themio Bioanalysis Vision 2000 
MALOI-TOF operated in reflectron mode with 5 and 20 kV on the target 
and conversion dynode, respectively. Theoretical average molecular 
IB masses (M,(calc)) of the fragments were calculated from atomic 

compositions; the mass of a proton (1.08 Da) is subtracted from raw 
data values in reporting experimental molecular masses (M,(exp)) as 
neutral basis. An external calibration generated from eight peaks 
(3000-18000 Da) was applied to all spectra. 
20 RESULTS 

Digestion \Mth Cfol alone. 

The inset to Figure 57a shows a 12% polyacrylamide gel 
electrophoretic separation of an e3/e3 genotype after digestion of the 
252 bp apo E amplified product with Cfol. Comparison of the 
I ctrophoretic bands with a molecular weight ladder shows the cutting 
pattern to be as mostly as expected (Table V) for the €3/e3 genotype. 
Differences are that the faint band at approximately 25 bp is not 
xpected, and the smallest fragments are not observed. The 
accompanying mass spectrum of precipitated digest products shows a 
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s.m.lar pattern, albeit at higher resolution. Comparison with Table V 
shows that the observed masses are consistent with those of single 
stranded ONA; the combination of an acidic matrix environment (3-HPA 
PK. 3) and the absorption of thermal energy via Interactions with the 337 
5 nm absorbing 3-HPA upon Ionization Is known to denature short 
stretches of dsDNA under normal MALDI conditions (Tang, K gt aL, 
(1994) Rapid Common Mass Spectrom 8:183-186). 

The approximately 25-mers, unresolved witii electrophoresis are 

(7427 Da) of these may represent a doubly charged ion from the 14 8 
kDa fragments (m = 14850. z = 2; m/z = 7425). the 6715 and 7153 
Da fragments could result from PCR artifacts or primer impurities- all 
^ree peaks are not observed when amplified products are purified with 
Q.agen purification kits prior to digestion. The Table V 8871 Da 29-mer 
sense strand 3'-tem,lnal fragment Is not observed; ti.e species detected 
at 9186 Da Is consistent with the addition of an extra base (9187 - 8871 
- 316. consistent witi. A) by the Taq-polymerase during PCR 
amplification (Hu, G et aL (1 993) DNA and Cell Biol 1 2:763-770) The 
-nd-vdual single strands of each double strand witi, <35 bases (11 kDa) 
are resolved as single peaks; ti,e 48-base single strands (M,(calc) ,4845 
and 14858). however, are observed as an unresolved single peak at 
14850 Da. Separating tiiese Into single peaks would require a mass 
r sok^tion (m/^m. ti.e ratio of ti,e mass to the peak widti, at half height) 
25 "-^^ - -gnltude greater ti,an what Is 

~utine wrth the standard reflectron tlme-of-f,ight instrumentation used In 
th.s study; resolving such small mass differences witi, high perfom^ance 
.nstrumentation such as Fourier transform MS. which provides up to 
^ree ord of magnitude higher resolution In ti,ls mass range, has been 
demonstrated. The 91-mer single strands (M,(calc)27849 and 28436) 
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are also not resolved, even though this requires a resolution of only <50. 
The dramatic decrease In peak quality at higher masses Is due to 
metastable fragmentation (i.e. depurination) resulting from excess internal 
energy absorbed during and subsequent to laser Irradiation. 
5 Simultaneous digestion with Cfol and Rsal. 

Figure 57b (inset) shows a 12% polyacrylamide gel elec^ophoresls 
separation of f 3/e3 double digest products, with bands consistent with 
dsDNA with 24, 31, 36, 48, and 55 base pairs, but not for the smaller 
fragments. Although more peaks are generated (Table V) than with Cfol 
10 alone, the corresponding mass spectrum is more easily Interpreted and 
reproducible since all fragments contain <60 bases, a size range far 
more appropriate for MALDI-MS if reasonably accurate values (e.g.. 
0.1%) are desired. For fragments In this mass range, the mass 
measuring accuracy using extemal calibration is -0.1% (i.e. <^10 Da at 
15 10 kDa). Significant depurination (indicated In Figure by asterisk) Is 

bserved for all peaks above 10 kDa, but even the largest peak at 17171 
Da Is cleariy resolved from Its depurination peak so that an accurate M, 
can be measured. Although molar concentrations of digest products 
should be identical, some discrimination against those fragments with 
20 ^1 1 bases Is observed, probably due to their loss in the 

ethanol/glycogen precipitation step. The quality of MS results from 
simultaneous digestion with Cfol (or Hhal) and Rsal is superior to those 
with Cfol (or Hhal) alone, since the smaller fragments generated are good 
for higher mass accuracy measurements, and with all genotypes there Is 
25 no possibility for dimer peaks overiapping with high mass diagnostic 
peaks. Since digestion by Rsal/Cfol and Rsal/Hhal produce the same 
restriction fragments but the fonner may be performed as a simultaneous 
dig st sine th ir buffer requirements are the same, this enzyme mixture 
was used for all subsequent genotyping by restriction digest protocols. 
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Table V 

Mass and Copy Number of Expected Restriction Digest Products 
Table Va Cfol Digestion" 




Table Vb. Cfol/Rsal Digestion" 



( + ) (-) 
3428, 4025 
5283, 5880 



5781, 5999 



11279, 11627 



14845, 14858 
18269, 18848 



e2/e2 



e2/e3 e2/e4 



1 

2 



eS/eS 
2 



e3/e4 
2 
1 



e4/e4 
2 
2 



•Cfol Invariant fragment masses: 1848, 2177, 2186, 2435. 4924, 5004, 
5412, 5750, 8871. 9628 Da. 
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"Cfol/Rsal Invariant fragment masses: 1848, 2177, 2186, 2436, 4924 
5004, 5412. 5750, 6745, 7510, 8871, 9628, 16240, 17175 Da. 



Table VI 



ddT M, (Calc) 



ddT M, (Exp) 



ddC M, (Calc) 



ddC M, (Exp) 



e2/e2 



•5918, '6768 



•6536, '•7387 



e2/e3 



•5918, "6768, 
*7965 



5919, 6769. 
7967 



•6536, "6753, 
'7387 



6542. 6752. 
7393 



e2/e4 



'5918, "6768, 
•7965, •8970 



•5903, '6536, 
'6753, V387 



e3/e3 



•6918, "7965 



5918, 7966 



•6536, '6753 



6542, 6756 



e3/ 4 



•5918. "7965. 
•8970 



5914, 7959. 
8965 



•5903, '6536. 
'6753 



5898. 6533. 
6747 



10 



15 



4/e4 



7965. '8970 



5903. '6753 



5900. 6752 



7966. 8969 

■hrom codon 112 detection primer (unextended 5629.7 Da)," 
'Fr m codon 158 detection primer (unextended 6480.3 Da). 
Dashed lines: this genotype not available from the analyzed pool of IpO 
patients. 

Hgure 58a-c shows the ApoE c3/e3 genotype after digestion with 
Cfol and a variety of precipitation schemes; equal volume aliquots of the 
same amplified product were used for each. The sample treated with a 
single precipitation (Figure 58a) from an ammonium acetate/ethanol/gly- 
cogen -solution results in a mass spectrum -characterized by broad peaks, 
specially at high mass. The masses for intense peaks at 5.4, 10.7, and 
14.9 kDa are 26 Da (0.5%), 61 Da (0.6%). and 45 Da (0.3%) Da higher, 
r spectiv ly. than the expected values; the resolution (the ratio of a peak 
width at half its total intensity to the measured mass of the peak) for 
each of these is -50, and d creas s with increasing mass. Such 
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observations are consistent with a high level of nonvolatile cation 
adduction; for the 1 0.8 kDa fragment, the observed mass shift is 
consistent with a greater than unit ratio of adducted.nonadducted 

molecular ions. 

5 MS peaks from a sample redissolved and precipitated a second 

f me are far sharper (Rgure 58b), with resolution values nearly double 
those of the corresponding Rgure 58a peaks. Mass accuracy values are 
also considerably improved; each is within 0.07% of Its respective 
calculated values, close to the Independently determined instrumental 
10 limits for DNA measurement using 3-HPA as a matrix. Single (not 
shown) and double (Rgure 58C) precipitations with isopropyl alcohol 
(IPA) instead of ethanol result in resolution and mass accuracy values 
comparable to those for corresponding ethanol precipitations but 
enhanced levels of dimerization are observed, again potentially confusing 
measurements when such dimers overlap with higher mass "diagnostics- 
monomers presem in the solution. EtOH/ammonium acetate precipitation 
with glycogen as a nucleation agent results in nearly quantitative 
recovery of fragments except for the 7-mers, serving as a simultaneous 
concentration and desalting step prior to MS detection. Precipitation 
from the same EtOH/ammonium acetate solutions in the absence of 
glycogen results in far poorer recovery, especially at low mass. 

The results indicate that to obtain accurate (M,(exp) values after 
either IPA and EtOH precipitations, a second precipitation is necessary 
to maintain high mass accuracy and resolution. 
25 The ratio of matrix.digest product also affects spectral quality- 

severe suppression of higher mass fragments (not shown) observed with 
1:1 volume matrix: digest product (redissolved in 1//L) is alleviated by 
"Sing a 3 - 5 fold volume excess of matrix. 
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Apo £ genotyping by enzymatic digestion. Codon 112 and 1 58 
polymorphisms fall within Cfol (but not Rsal) recognition sequences. In 
the 252 bp amplified product studied here, invariant (i.e. cut in all 
genotypes) sites cause cuts after bases 31, 47, 138, 156, 239, and 
5 246. The cutting site after base 66 is only present for €A, while that 
after base 204 is present in c3 and €4; the c2 genotype is cut at neither 
of these sites. These differences in the restriction pattern can be 
demonstrated as variations in mass spectra. Rgure 59 shows mass 
spectra from several ApoE genotypes available from a pool of 100 

10 patients (Braun, A et ah, (1 992) Hum. Genet. 89:401 -406). Vertical 
dashed lines are drawn through those masses corresponding to the 
expected Table V diagnostic fragments; other labeled fragments are 
invariant. Refemng to Table V, note that a fragment is only considered 
"invariant" if it is present In duplicate copies for a given allele; to satisfy 

15 this requirement, such a fragment must be generated in eaclj of the €2m 
. €3, and e4 alleles. 

The spectrum in Rgure 59a contains all of the expected invariant 
fragments above 3 kDa, as well as diagnostic peaks at 3428 and 4021 
(both weak), 11276 and 11627 (both intense), 14845, 18271, and 

20 18865 Da. The spectrum In Rgure 59b is nearly identical except that the 
pair of peaks at 18 kDa is not detected, and the relative peak intensities, 
most notably among the 11-18 kDa fragments, are different. The 
spectrum in Figure 59c also has no 18 kDa fragments, but instead has 
new low intensity peaks between 5-6 kDa. The intensity ratios for 

25 fragments above 9 kDa are similar to those of Rgure 59b except for a 
relatively lower 1 1 kDa fragment pair. Rgure 59d, which again contains 
the 5-6 kDa cluster of peaks, is the only spectrum with no 1 1 kDa 
fragm nts, and lik the previous two also has no 18 kDa fragment. 
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Despite the myriad of peaks in each spectrum 
d.a9no«.c peaks. Due to *e Bn^ed .^Us^ „ ^ ^MJ^^ro^ 

are «,ose based upon *e presenoe or absence of *e four <.,ag„o«ic 
*«9mente between 5.2 and 6.0 KDa characteristic of «,e .4 .I^l 
-ese *^„»«. „,3rtv ovefap with severe, invanj^r T 
' ou„d Here, .bat .be Oa d..nosbc ,ra,„„„, Zt 2 . " 

depunnabon pea. fron, tbe 54,2 Oa .nvanan. fras„«n.. and I s' 8, Da 
10 d.a3„os«c pea. ,s nor.a..v no. co.p,ete.v «so,ved fron, tbe ^760 0^ 
.nvanant fra^-nent. ■n,us. disbnpulsb.n, between an .2,.4 a ".^.3 or 
^^eer, an «/.4 and an .3/.3 a„e,e. re,lee upon «.e presence ^ 
absence of «« 5880 and 5989 Da fragn^nts. Eacb of tbeae is present 
•n Haures 69c and 59d, but not in 59a or 59b. 
" The genotype of each of fte patiente In Hgure 59 can b. ^ 

rapidiv i-enbfied by reference .0 the flowchart in Rgure r L . „, 

jpn . Htesence of the unresolved 14 e kn^ 

^ *«9ments„ inconsistent wi* homozygous .2. butleaves four 

Po^^^ities ,.2/.3. .2,.4. ^,.3. .3/.4,. Of *ese o„.y .2/.3 and .2/.4 
are cons-sten. w«h the ,8 .Da pea..; the iac. of peaics at 5283. 5879 
577a ana 5998. Da indicate «,a. tbe Rgure 59a san,.e is .2/.3. Osi^g 

^t<Z h : '° ^" ■•«'-««ca«ons by 

th» method have been consistent with, and in many cases mor. •. 
■oterpreted *an. those attained via oonventiona, meC Z " 
assignment can be further confirnied by assuring th.t f 
™«os are con.s.en. w«h «.e c.y nulersTf^r 
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the 1 4.8 kOa fragments are of lower intensity than those at 1 6- 1 7 kDa 
In Figure 59a, but the opposite is seen in Figures 59b-d. This is as 
expected, since in the latter three genotypes the 14.8 kDa fragments are 
present in duplicate, but the first is a heterozygote containing €2, so that 
5 half of the amplified products do not contribute to the 14,8 kDa signal 
Likewise, comparison of the 1 1 kDa fragment intensify to those at 9.6 
and 14.8 kDa indicate that this fragment is double, double, single, and 
zero copy in Figures 59a, d, respectively. These data confirm that 
MALDI can perform in a semi-quantitative way under these conditions. 
10 Apoe genotyping by Primer Oligo Base Extension (PROBE). The 

PROBE reaction was also tested as a means of simultaneous detection of 
the codon 112 and 158 polymorphisms. A detection primer is annealed 
to a single-stranded PCR-amplified template so that its 3' terminus is just 
downstream of the variable site. Extension of this primer by a DNA 
15 polymerase in the presence of three dNTPs and one ddXTP (that is not 
present as a dNTP) results in products whose length and mass depend 
upon the identity of the polymorphic base. Unlike standard Sanger type 
sequencing, in which a particular base-specific tube contains -99% dXTP 
and -1% ddXTP, the PROBE mixture contains 100% of a particular 
20 ddXTP combined with the other three dNTPs. Thus with PROBE a full 
stop of all detection primers is achieved after the first base 
complementary to the ddXTP is reached. 

For the €2/63 genotype, the PROBE reaction (mixture of ddTTP, 
dATP, dCTP, dGTP) causes a M,(exp) shift of the codon 1 12 primer to 
25 5919 Da, and of the codon 158 primer to 6769 and 7967 Da (Table VI); 
a pair of extension products results from the single codon 1 58 primer 
because the €2J€3 genotype Is h ter zygous at this position. Thre 
xt nsion products (one from codon 1 58, two from 112) are also 
bserved from the heterozygote c3/€4 (Hgure 61c and Table VI), while 
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only two products (one from each primer) are observed from the Figure 
61b (f3/e3} and Figure 59d (€4/e4) homozygote alleles. Referring to 
Table VI, each of the available alleles result in all expected ddT reaction 
product masses within 0. 1 % of the theoretical mass, and thus each is 
5 unambiguously characterized by this data alone. Further configuration of 
the allele Identities may be obtained by repeating the reaction with 
ddCTP (plus dATP, dTTP, dGTP); these results, summarized also In Table 
VI, unambiguously confirm the ddT results. 

Appropriateness of the methods. Comparison of Figures 59 
10 (restriction digestion) and 61 (PROBE) indicates that the PROBE method 
provides far more easily interpreted spectra for the multiplex analysis of 
codon 1 1 2 and 1 58 polymorphisms than does the restriction digest 
analysis. While the digests generate up to -25 peaks per mass spectrum 
and in some case diagnostic fragments overtapping with invariant 
1 5 fragments, the PROBE reaction generates a maximum of only two peaks 
per detection primer (i.e. polymorphism). Automated peak detection, 
spectrum analysis, and allele identification would cleariy be far more 
straightforward for the latter. Spectra for highly multiplexed PROBE, in 
Which several polymorphic sites from the same or different amplified 
20 products are measured from one tube, are also potentially simple to 

analyze. Underscoring its flexibility, PROBE data analysis can be further 
simplified by judicious a priori choice of primer lengths, which can be 
designed so that no primers or products can overiap in mass. 

Thus while PROBE is the method of choice for large scale clinical 
25 testing of previously well characterized polymorphic sites, the restriction 
digest analysis as described here is Ideally suited to screening for new 
mutations. Th identity of ach of the two polymorphisms discuss d in 
th.s study affects the fragment pattern; if this is the only informati n 
used, then the MS detection is a faster alternative to conventional 
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electrophoretic separation of restriction fragment length polymorphism 
products. The exact measurement of fragment M, values can also give 
information on about sites completely remote from the enzyme 
recognition site since other single point mutations necessarily alter the 
5 mass of each of the single strands of the double stranded fragment 
containing the mutation. The 252 bp amplified product could also 
contain allelic variants resulting in, for example, previously described 
Glyl27 Asp (Weisgraber, KH et al.. (1984) J^CIiiiJnyest, 23:1 024- 
1033), Argl36Ser (Wardell, MR et aL. (1987) i_C!iiLjnyest 80:483- 
10 490), Argl42Cys (Horie, Y et aL. (1 992) J. Biol. Chem 2§2:1 962-1 968), 
Arg145Cys (Rail SC Jr et SL, (1982) Proc. Natl- Anad. Sci. U a 
79:4696-4700), Lysl46Glu (Mann, WAgtaL (1995) J. Clin. Invest 
96:1 100-1107), or Lysl46Gln (Smit, M et aL, (1990) JJJEid.Res, 31:45. 
53) substitutions. The G-^A base substitution which codes for the 
15 Glyl 27 Asp amino acid substitution would result in a -1 6 Da shift in the 
sense strand, and in a + 15 Da (C-T) shift in the antisense strand, but 
not in a change In the restriction pattern. Such a minor change would be 
virtually invisible by electrophoresis; however, with accurate mass 
determination the substitution could be detected; the invariant 55-mer 
20 fragment at 1 6240 (sense) and 1 71 75 Da would shift to 1 6224 and 

17190 Da, respectively. Obtaining the mass accuracy required to detect 
such minor mass shifts using current MALDI-TOF instrumentation, even 
with internal calibration, is not routine since minor unresolved adducts 
and/or pooriy defined peaks limit the ability for accurate mass calling. 
25 With high performance electrospray ionization Fourier transform (ESI- 
FTMS) single Da accuracy has been achieved with synthetic 

ligonucleotides (Uttie, DP etaL. (1995) Proc. Natl. Acad, li « ,a 
92:2318-2322) up to 100-m rs (Uttle, DP etaL, (1994) J. Am. Chem 
Soc^ 116:4893-4897), and similar results have recentiy been achieved 
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EXAMPLE 13 

A Metfiod for Mass Spectrometric Detection of DMA Fr«»^^ * 
5 Associated With Telomerase Activity Fragments 

INTRODUCTION 

One-fourth of all dea*s in *e United Sta,« are due ,o n,aligna„, 
^mo. ,R.K. J3in. (1996, S^22l.,079-,080,. For diag„os«c and 

iu "netnods of tumor cell detection. 

Malignant cells can be distinfluisl,ed fton, „„^3| cella by different 
P~^a. One of tbosc i. t.e imnK,ru,l.a«on of malignant oel IZ 
ena«es uncontrolled ce,.^olifera«on. Nom«, diploid n,a„»nalian cells 
undergo a fi™,e „un*er of populadon doublings in culture, before they 
undergo senescence. '« is supposed that the number of population 
doubkngs in culture, before they undergo senescence. It is supposed 
that the number of p.pula«on doublings is related to the shortening of 

*~mosome ends, called telomers. i ry cell division, ■n.e reason for 

sa,d Shortening is based on the properties of the conventional 
;r^7-™'«V' ~" -chinery. DNA polymerases only wor. in 
o to3 direction and need an RNA primer. 

Immortalization Is thought to be associated with the expression of 
acbve telomerase. Said telomerase is a ribonucleoprotein catalyzing 
repet«ive .longabon of templates. This activity can be detected in a 
26 native protein extract of telomerase containing cells by a special PCR- 
system(N.W. KimasL(1994) S£ieaca2Sg..20„.20,5, known as 
telomeric repeat ampHfication protocol (TRAP,. The assay as used 
herem. is based on the t temerase sp cific extension of a ^bstrate 
Pnm r aS, and a subsequent ampir,ca«on of ti,e telomerase sp cific 
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extenslon products by a PGR step using a second primer (bioCX) 
complementary to the repeat structure. The characteristic ladder 
fragments of those assays are conventionally detected by the use of gel 
electrophoretic and labeling or staining systems. These methods can be 
5 replaced by MALDI-TOF mass spectrometry leading to faster accurate 
and automated detection. 
MATERIALS AND METHODS 
Preparation of cells 

1 X 10« cultured telomerase-positive cells were pelleted, washed 
10 once with PBS (137 mM NaCI. 2.7 mM KCI, 4.3 mM Na2HPoJ.7H,0, 
1.4 mM KH2PO4 in sterile DEPC water). The prepared cells may be 
stored at -750C. Tissue samples have to be homogenized, according to 
procedures well known in the art, before extraction. 
Telomerase extraction 
1 5 Pellet was resuspended in 200 //I CHAPS lysis buffer ( 1 0 mM Tris- 

HCI pH 7.5, 1 mM MgCI^, 1 mM EGTA, 0.1 mM benzamidine, 5 mM P- 
mercaptoethanol, 0.5% CHAPS, 10% glycerol) and incubated on ice for 
30 min. The sample was centrifuged at 12,000 g for 30 min at 4«>C. 
The supernatant was transfen^ed into a fresh tube and stored at 75«C 
20 until use. 

TRAP-assay 

2 //I of telomerase extract were added to a mixture of lOx TRAP 
buffer (200 mM Tris-HCI pH 8.3, 15 mM MgCI,, 630 mM KCI, 0.05% 
Tween 20, 10 mM EGTA) 50x dNTP-mix (2.5 mM each dATP. dTTP, 
25 dGTP, and dCTP), 10 pmol of TS primer and 50 pmol of bio CX primer in 
a final volume of 50 //I. The mixture was incubated at 30«C for 10 
minutes and 5 min. at 94<»C, 2 units of Taq Polymerase were added and 
a PGR was performed with 30 cycles of 94oc for 30 seconds, 50OC for 
30 seconds and 72 •C for 45 seconds. 
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Purification of TRAP-assay products 

For every TRAP-assay to be purified, 50 //I Streptavidin M-280 
Dynabeads (10 mg/ml) were washed twice with 1x BW buffer (5 mM 
Trfs-HCI, pH 7.5, 0.5 mM EDTA, 1 M NaCI). 50 >^ of 2x BW buffer were 
added to the PCR mix and the beads were resuspended in this mixture 
The beads were incubated under gentle shaking for 15 min. at ambient 
temperature. The supernatant was removed and the beads were washed 
twice with IX BW buffer. To the beads 50 ^ 25% ammonium hydroxide 
were added and Incubated at 6OOC for 10 mIn. The supernatant was 
saved, the procedure repeated, both supematants were pooled and 300 
tA ethanol (100%) were added. After 30 mIn. the DNA was pelleted at 
13,000 rpm for 12 min., the pellet was air-dried and resuspended in 600 
nl ultrapure water. 

MALDl-TOFMS of TRAP-assay products 
15 300 nl sample were mixed with 500 nl of saturated matrix-solution 

(3-HPA:ammonium citrate = 10:1 molar ratio in 50% aqueous 
acetonitrile), dried at ambient temperature and introduced into the mass 
spectrometer (Vision 2000, Rnlgan MAT). All spectra were collected in 
reflector mode using extemal calibration. 
20 Sequences and masses 

bioCX: d(bio-CCC TTA CCC TTA CCC TTA CCC TAA SEQ ID NO 45) 
mass: 7540 Da. 

TS: d(AAT CCG TGC AGC AGA GTT SEQ ID N0.46), mass: 5523 Da 
Telomeric-repeat structure: (TTAGGG)„, mass of one repeat: 1909.2 
25 Amplffication products: 

TS elongated by three telomeric repeats (first amplification product)- 
12452 Da. (N3) 

TS longated by four telomeric repeats: 14361 Da. (N^) 
TS longated by seven telomeric repeats: 20088 Da. (N,) 



wo 98/20166 



PCT/US97/20444 



-151- 



RESULTS 

Figure 62 depicts a section of a TRAP-assay MALDI-TOF mass 
spectrum. Assigned are the primers TS and bioCX at 5497 and 7537 
Da, respectively (calculated 5523 and 7540 Da). The signal marked by 
5 -an asterisk represents n-1 primer product of chemical DNA synthesis. 
The first telomerase specific TRAP-assay product is assigned at 1 2775 
"Da. This product represents a 40-mer containing three telomeric repeats. 
Due to primer sequences this is the first expected amplification product 
of a positive TRAP-assay. The product is elongated by an additional 
10 nucleotide due to extendase activity of Taq DNA polymerase (calculated 
non-extended product: 12452 Da, by A extended product: 12765 Da). 
The signal at 6389 Da represents the doubly charged ion of this product 
(calculated: 6387 Da). Rgure 63 shows a section of higher masses of 
the same spectrum as depicted in figure 62, therefore the signal at 
15 12775 Da is Identical to that in figure 62. The TRAP-assay product 

containing seven telomeric repeats, representing a 64-mer also elongated 
by an additional nucleotide. Is detected at 20322 Da (calculated: 20395 
Da). The signals marked 1, 2, 3 and 4 cannot be base-line resolved. 
This region includes of: 1. signal of dimeric n-1 primer, 2. second TRAP- 
20 assay amplification product, containing 4 telemeric repeats and therefore 
representing a 46-mer (calculated: 14341 Da/14674 Da for extendase 
longated product) and 3. dimeric primer-ion and furthermore all their 
corresponding depurination signals. There Is a gap observed between 
the signals of the second and fifth extension product. This signal gap 
25 corresponds to the reduced band Intensities observed in some cases for 
the third and fourth extension product In autoradiographic analysis of 
TRAP-assays (N.W. Kim etsL (1994) Science 266:2013). 

Th above-mentioned probi ms, caused by the dimeric primer and 
r lated signals, can be overcome using an ultrafiltration step employing a 
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molecular weight cut-off membrane for primer removal prior to MALDI 
TOF-MS analysis. This will permit an unambiguous assignment of the 
second ampfification product. 

EXAMPLE 14 

5 A method for Detecting Neuroblastoma -Specific Nested rt =^ ^ 
products Via NIALDI-TOF Mass SpectromettT RT-ampfified 
Introduction 

Neuroblastoma Is predominantly a tumor of early childhood with 
66 ^ of the cases presenting in children younger than s" years of age 
The most common symptoms are those due to tumor mass, bone pain or 
those caused by excessive catecholamine secretion. In rare cases ' 
neuroblastoma can be Identified prenatally (R.W. Jennings et al (I993, 
^L£^^^^ 28:1 ,68-1 174,. Approximately 70% of all patients with 
neuroblastoma have metastatic disease at diagnosis. The prognosis is 
dependent on age at diagnosis, clinical stage and other parameters 
For diagnostic purposes there is a high interest in reliable and 
.sensitive methods of tumor cell detection, ^ in control of autologous 
bone marrow transplants or on-going therapy. 

Since catecholamine synthesis is a characteristic property of 

TT^lTr "^'^ ™ ^^"^ '''' -^-^ <H. Naito et 

(1991) Pur. J. Cancer 22:762-765), neuroblastoma cells or 

metastasis In bone marrow can be identified by detection of human 
tyrosine 3-hydroxylase (E.G. 1.14.16.2, hTH) which catalyzes the first 
step in biosynthesis of catecholamines. 

The expression of hTH can be detected via reverse transcription 
IRT) polymerase chain reaction (PGR) and the amplified product can be 
analyzed via MALDI-TOF mass spectrometry. 
Mat rials and methods 

Celt- or tissue-treatment 
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Cultures cells were pelleted (10 min. 8000 rpm) and washed twice 
with PBS (137 mM NaCI, 2,7 mM KCI, 4.3 mM Na^HP04»7H^0, 1.4 mM 
KH2PO4 in sterile PEPC water). The pellet was resuspended in 1 ml 
lysis/binding buffer (100 mM Tris-HCI, pH 8.0, 500 mM UCI, 10 mM 
5 EDTA, 1 % U-dodecyl sulfate, 5 mM DTT) until the solution becomes 
viscose. Viscosity was reduced by DNA-shear step using a 1 ml syringe. 
The lysate may be stored in -75«C or processed further directly. Solid 
tissues (s^ patient samples) have to be homogenized before lysis. 
Preparation of rrtagnetic Oligo-dT(25) beads 
10 100 M- beads per 1x10« cells were separated from the storage 

buffer and washed twice with 200 //L lysis/binding buffer. 
Isdfation of poly A* RNA 

The cell lysate was added to the prepared beads and incubated for 
5 min. at ambient temperature. The beads were separated magnetically 
15 for 2-5 min. and washed twice with 0.5 ml LDS (10 mM Tris-HCI, pH 
.8.0, 0. 1 5 M Ua, 1 mM EDTA, 0. 1 % UDS). 

Solid-phase fJrst-strand cDNA synthesis 

The poly A*RNA containing beads were resuspended in 20 pL of 
reverse transcription mix (50 mM Tris-HCI. pH 8.3, 8 mM MgQ^, 30 mM 

20 Ka, 10 mM DTT, 1.7 mM dNTPs, 3 U AMV reverse transcriptase) and 
incubated for 1 hour at 45<»C (with a resuspension step all ten min.). 
The beads were separated from the reverse transcription mix, 
resuspended in 50 fA. of elution buffer (2 mM EDTA pH 8.0) and heated 
to 95«»C for 1 min. fur elution of the RNA. The beads with the cDNA 

25 first-strand can be stored in TB (0.089 M Tris-base, 0.089 M boric acid, 
0.2 mM EOTApH 8.0). TE 10 mM Tris-HCI, 0.1 mM EDTA, pH 8.0) or 
70% ethanol for further processing. 

Nested polymerase chain reaction 
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Beads containing cDNA first-strand were washed twice with 1 x 
PCR buffer (20 mM Tris-HCI pH 8.75, 10 mM KCI, 10 mM (NH,LSO 2 

MgSO., 0.,% Triton X-100, 0.1 bovine seru. albumin; and^' 
resuspended in PCR mix (containing 100) pmol of each outer primer 2 5 
B u Pfa (exo-, DNA poiymerase, 200 of each dNTP and PCR buffer in ! 
finaf volume of 50 pL). The mixture was incubated at 72«C 1 mi„ w 

PCR was added as template to a PCR mix d(as above but nested primers 
.nstead of outer primers, and subjected to the following temperature 
10 program: 940C 1 min. 650c 1 min. and 72oc 1 min. for 20 cycles 
Purification of nested amplified products 

10 JT" by-products are remove, using 

ia000 0acu,-o«u..rsfihratio„.uni.. Ultrafiltration was pertor„„<, a. 
7,500 g for 26 minutes. For every PCR to be purified so ,y< «. 
- M.SO Oyna^ads „0 .ng« _ ..3Hed Z:':L^:L^ZTT 
mM Tris-Ha, pH 7.5. 0.5 mM EDTA. 1 M NaCI,. added to «,e 
.u«.rafi«ra«on membrane and incubated u^Jer genUe shaking for 15 min 
a. amb,ent temperature. The supernatam was removed and «,e beads " 

«. were added to the beads and .™=„bated at ambient temperature L T 
™n. the supernatant was saved, the procedure repeated, both 
supematants were pooled and 300 pL ethanol (100%) were added 
After 30 min. the DNA was pelleted at 13.000 rpm for 12 min.. the 
pe«et was airnlried and resuspended in 600 nl ultrapure water 

'^•ALDI-rOF MS of msted emplima products 

300 nl sample was mixed with 500 nl of saturat»H ™.. ■ 
„„ saturated matrix-solution 

(3-HPA: ammonium citrate = 10:1 molar ratio in 60% aqueous 
acetonitrile,. dried at ambient temperature and introduced into the mass 
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spectrometer (Vision 2000, Rnigan MAT). All spectra were collected in 
reflector mode using external calibration. 
Outer primers: 

hTHl: d(TGT CAG AGO TGG ACA AGT GT SEQ ID NO:47} 

5 hTH2: d(GAT ATT GTC TTC CCG GTA GC SEQ ID NO;48) 

Nested primers: 

bio-hTH d(bio-CTC GGA CCA GGT GTA CCG CC SEQ ID NO:49), 
mass: 6485 Da 

hTH6; d(CCT GTA CTG GAA GGC GAT CTC SEQ ID NO:50), 
10 mass: 6422 21 Da 

mass of biotinylated single strand amplified product: 19253:6 Da 
mass of nonbiotinylated single strand amplified product: 1 8758.2 
Da 
Results 

15 A MALDI-TOF mass spectrum of a human tyrosine 3-hydroxylase 

(hTH) specific nested amplified product (61-mer) is depicted in figure 64. 
The signal at 18763 Da con-esponds to non-biotinylated strand of the 
amplified product (calculated: 1 8758.2 Da, mass error: 0.02 Da). The 
ignals below 10,000 and above 35,000 Da are due to multiply charged 
20 and dimeric amplified product-ions, respectively. 

The product was obtained from a solid phase cDNA derived in a 
reverse transcription reaction from 1x1 0« cells of a neuroblastoma cell- 
line (L-A-N-1) as described above. The cDNA first-strand was subjected 
to a first PGR using outer primers (hTHl and hTH2), an aliquot of this 
PCR was used as template in a second PCR using nested primers (biohTH 
and hTH6). The nested amplified product was purified and MALDI-TOF 
MS analyzed: 



25 
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The spectrum in Fig. 64 demonstrates the possibility of 
neuroblastoma cell detection using nested RT-PCR and MALDI-TOF MS 
analysis. 

EXAMPLE 15 

Material and Methods 
Probe 

The identity of codon 634 in each of the three alleles was 
confirmed by Rsal enzymatic digestlon,.single strand conformational 
polymorphism or Sanger sequencing. Exon 1 1 of the RET gene was PGR 
amplified (40 cycles) from genomic DNA using Taq-Polymerase 
(Boehringer-Mannheim) with 8 pmol each of 5'-biotinylated forward (5'- 
biotin-CAT GAG GCA GAG CAT ACG CA-3' SEQ ID NO:51) and 
15 unmodified reverse (S'-GAC AGO AGC ACC GAG ACG AT-3' SEQ ID 
NO:52) primer per tube; amplified products were purified using the 
Qiagen (QIAquick" kit to remove unincorporated primers. 1 5 ^1 of 
amplified product were immobilized on 10//L (10 mg/mL) Dynal 
streptavidin coated magnetic beads, denatured using the manufacturer's 
20 protocol, and the supernatant containing antisense strand discarded, the 
PROBE reaction was perfomied using thermoSequenase (TS) DNA 
Polymerase (Amersham) and Phannacia dNTP/ddNTPs. 8 pmol of 
extension primer (5'-CGG CTG CGA TCA CCG TGC GG-3' SEQ ID 
NO:53) was added to 13 ^ H,0. TS-buffer, 2mM ddATP (or 
25 ddTTP), and 2fiL of 0.5 mM dGTP/dCTP/dTTP (or dGTP/DCTP/dATP), 
and the mixture heated for 30 sec @ 940C, followed by 30 cycles of 10 
sec @ 940C and 45 sec @ 50oC; after a 5 min. incubation @ 95°C, the 
sup matant was decanted, and products were desalted by ethanol 
pr cipitation with the addition of 0.5 //L of lOmg/mL glycogen. The 
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resulting pellet was washed in 70% ethanol, air dried, and suspended in 
1 //L H^d. 300 nL of this was mixed with the MALOt matrix (0.7 M 3- 
hydroxypicolinic acid, 0.07 M ammonium citrate in 1:1 HjOiCHaCN) on a 
stainless steel sample probe and air dried. Mass, spectra were collected 
5 on a Thermo Bionalysis Vision 2000 MALDI-TOF operated in reflectron 
mode with 5 and 20 kV on the target and conversion dynode, 
respectively. Experimental masses {m,(exp)) reported are those of the 
neutral molecules as measured using external calibration. 
Direct Measurement of Diagnostic Products 
10 PGR amplifications conditions for a 44 bp region containing codon 

634 were the sarnie as above but using Pfu polymerase; the forward 
primer contained a ribonucleotide at its 3'-terminus (forward, 5'-GAT 
CCA CTG TGC GAC GAG C (SEQ ID N0:54) -ribo; reverse, S'-GCG GCT 
GCG ATC ACC GTG C (SEQ ID NO:55). After product immobilization 
15 and washing, 80 fA. of 12.5% NH^OH was added and heated at SO^C 
overnight to cleave te primer from 44-mer (sense strand) to give a 25- 
mer. Supernatant was pipetted off while still hot, dried resuspended in 
: 50 fA. HjO, precipitated, resuspended, and measured by MALDI-TOF as 
above. MALDI-FTMS spectra of 25-mer synthetic analogs were collected 
20 as previously described (U, Y, si aL, (1 996) Anal. Chem. 68:2090- 
2096); briefly, 1-10 pmol DNA was mixed 1:1 with matrix on a direct 
insertion probe, admitted into the external ion source (positive ion mode), 
ionized upon irradiance with a 337 nm wavelength laser pulse, and 
transferred via rf-only quadruple rods into a 6.5 Tesia magnetic field 
25 where they were trapped collisionally. After a 15 second delay, ions 
were excited by a broadband chirp pulse and detected using 256K data 
points, resulting in time domain signals of 5 s duration. R port d 
(neutral) masses are those of the most abundant isotope peak after 
ubtracting the mass of the charge carrying proton (1.01 Da). 
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Results 



The first scheme presented utilizes the PROBE reaction shown 
schematically in Rgure 65. A 20-mer primer is designed to bind 
specifically to a region on the complementary template downstream of 
5 the mutation site; upon annealing to the template, which is labelled with 
biotin and immobilized to streptavidin coated magnetic beads, the PROBE 
primer is presented with a mixture of the three deoxynucleotide 
triphosphates (dNTPs), a di-dNTP (ddNTP). and a DNA polymerase 
(Figure 65). The primer is extended by a series of bases specific to the 
lO Identity of the variable base in codon 634; for any reaction mixture (e^ 

ddA+dT+dC+dG), three possible extension products representing the ' 
three alleles are possible (Rgure 65). 

For the negative control (Rgure 66), the PROBE reaction with 
ddATP+dNTPs (N=T, C, G) causes a M,(exp) shift of the primer from 
15 6135 to 6726 Da (Am + 591). The absence of a peak at 6432 rules out 
a C-.A mutation (Rgure 65); the mass of the single observed peak is 
^ more consistent with extension by C-ddA (M,(calc) 6721, +0.07o^ error) 
, than by T-ddA (Mr(calc) 6736, -0. 1 5% error) than of A3TC,G expected 
for C--A mutant. Combining the ddA and ddT reaction data, it is clear 
20 that the negative control is as expected homozygous normal at codon 
634. 



The ddA reaction for patient 1 also results in a single peak 
(M,(exp)=6731) between expected values for wildtype and C-*T 
mutation (Rgure 65b). The ddT reaction, however, results in two cleariy 
resolved peaks consistent with a heterozygote wildtype (H(exp) 8249, 
+ 0.04% mass error)/C-*T mutant (H(exp) 6428 Da, +0,08% mass 
error). For patient 2, the pair of Rgure 66c ddA products r present a 
h terozygote C-.A (M,(exp) 6431, -0.06% mass rror)/nom,al (M,(exp) 
6719, ^.03% mass rror) allele. The ddT reaction confirms this, with a 
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single peak measured at 8264 Da consistent with unresolved wlldtype 
and C-*A alleles. The value of duplicate experiments is seen by 
comparing Figures 66a and 66b; while for patient 1 the peak at 6726 
from the ddA reaction represents only one species, similar peak from 
patient 1 is actually a pair of unresolved peaks differing in mass by 15 



Da. 



An alternate scheme for point mutation detection is differentiation 
of alleles by direct measurement of diagnostic product -massed. A 44- 
mer containing the RET634 site was generated by the PCR, and the 19- 
1 0 mer sense primer removed by NH4OH cleavage at a ribonucleotide at its 
3' terminus. 

Rgure 67 shows a series of MALDI-FTiy/IS spectra of synthetic 
analogs of short amplified products containing the RET634 mutant site. 
Figures 67a-c and 67d-f are homozygous and heterozygous genotypes, 
15 respectively. An internal calibration was done using the most abundant 
isotope peak for tiie wildtype allele; application of this (external) 
calibration to tiie five otiier spectra resulted in better tiian 20 ppm mass 
accuracy for each. Differentiation by mass alone of the alleles is 
sti-aightforward. even for heterozygote mixtures whose components 
20 differ by 16.00 (Rgure 67d), 2501 (Rgure 67e), or 9.01 Da (Rgure 6Sf). 
The value of high performance MS is clear when recognition of small 
DNA mass shifts is the basis for diagnosis of tiie presence or absence of 
a mutation. The recent reintroduction of delayed extraction (DE) 
techniques has improved the performance of MALDI-TOF with shorts 
25 DNAs (Roskey, M.T. et aL, (1996) AriaLChem. 68:941-946); a resolving 
power (RP) of > 10^ has been reported for a mixed-base 50-mer, and a 
pair of 31-mere witii a C or a T (Am 1 5 Da) at a variable position 
r s Iv d nearly to baseline. Thus DE-TOF-MS has demonsti:at d the RP 
requir d for separation of tiie individual components of heterozygotes. 
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Even with DE, however, the precision of DNA mass measurement with 
TOF is typically 0. 1 % (8 Da at 8 kDa) using external calibration 
sufficiently high to result in incorrect diagnoses. Despite the possibility 
-of space charge induced frequency shifts (Marshall, A.G. eta/ 
5 AoaLObsm. ^:21 5A-229A), MALDI-FTMS mass errors are rarely as 
high as 0.005% (0.4 Da at 8 KDa), making internal calibration 
unnecessary/ 

The methods for DNA point mutation presented bare are not only 
applicable to the analysis of single base mutations, but also to less 
10 demanding detection of single or multiple base insertions or deletions 

and quantification of tandem two, three, or four base repeats. The 
PROBE reaction yields products amenable to analysis by relatively low 
perfom^ance ESI or MALDI instrumentation; direct measurement of short 
amplified product masses is an even more direct means of mutation 
15 detection,and will likely become more widespread with the increasing 
interest in high performance MS available with FTMS. 

EXAMPLE 16 

ImmobHization of nucleic acids on solid supports via an a«rfJ«Kji^ 
covaient bifunctional trityl finker acid-labile 

20 Aminolinked DNA was prepared and purified according to standard 

methods. A portion (lOeq) was evaporated to dryness on a speedvac 
and suspended in anhydrous DMF/pyridine (9:1; o.l ml). To this was 
added the chlorotrityl chloride resin (1 eq, 1.05^ol/mg loading) and the 
mixture was shaken for 24 hours. The loading was checked by taking a 
sample of the resin, detritylating this using B0% AcOH, and measuring 
the absorbance at 260nm. Loading was ca. 1 50pmol/mg resin. 

In 80% acetic acid, the half-life of cleavage was found to be 
substantially less than 5 minutes-this compares witii trityl ether-based 
approaches of half-lives of 105 and 39 minutes ior para and meta 
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substituted bifunctlonaf dimethoxytrityl linkers respectively. Preliminary 
results have also indicated that the hydroxy picolinic acid matrix alone is 
sufficient to cleave the DNA from the chlorotrityl resin. 

EXAMPLE 17 

5 Immobnization of nucleic acids on sofid supports via hydrophobic tritvl 
linker ^ 

The primer contained a 5 '-dimethoxytrityl group attached using 
routine trityl-on DNA synthesis. 

CIS beads from an oligo purification cartridge (0.2 mg) placed in a 
10 filter tip was washed with acetonitrile, then the solution of DNA (50 ng 
in 25 /A) was flushed through. This was then washed with 5% 
acetonitrile in ammonium citrate buffer (70 mM, 250 jvl). To remove the 
DNA form the CIS, the beads were washed with 40% acetonitrile in 
water (10//I) and concentrated to ca 2/d on the Speedvac. The sample 
15 was then submitted to MALDI. 

The results showed that acetonitrile/water at levels of ca.>30% 
are enough to dissociate the hydrophobic interaction. Since the matrix 
used in MALDI contains 50% acetonitrile, the DNA can be released from 
the support and successfully detected using MALDI-TOF MS (with the 
20 trityl group removed during the MALDI process). 

Rgure 69 is a schematic representation of nucleic acid 
immobilization via hydrophobic trityl linkers. 

EXAMPLE 18 

Immobilization of nudeic acids on soHd supports via Streptavidin- 
Zo Iminobiotin 

Experimental Procedure 

2-iminobiotin N-hydroxy-succinimid ester (Sigma) was conjugated 
t the oligonucleotides with a 3'- or 5-'amino linker following the 
conditions suggested by the manufacturer. The completion of the 
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reaction was confirmed by MALDI-TOF MS analysis and th. . 
purified by reverse phase HPLC. ^ 
For each reaction, 0. 1 mg of streptavidin-co«tow 

s :r r r 

Pmo, Of *e corresponding oBgo in *e presence o, ,M u.a 

"-ads bound w,* oligonucleotides v.ere washed nWce ^ „ ' 
ammonium carbonate (PH9.S). Then the beeds ^ 
3-HPA matrix e. room temperature ,or . J l^^Z^I: T 
10 was applied to MALOI-rOK .or b.«n ;s; l^^f 

experiment, 1.6. mol of free biosn (80-fold excess to * T 
' ^ Of .0 mM ammonium citrate was addedm::T 
n*.. incuba«on at room ,empe,«ure. , ^ of 3.H^ ' " 

and 0.S ^ Of supen^tant was appiied to mI^"™. ^ ^ 
IS the recovery of the bound iminobiotin oligo the blL T 

-=«nen. were again incubated with a Z'o^^^lT ""^ 
supernatant was appiied to MALDI-TOF MS Z „ ° ' 

. reieased i^r."::: ^L^ 

^ -'^-P'av.dm beads as Shown in Figures 70 and 71. 

EXAMPLE 19 

M"^.™ Analysis Using Loop P,*„er o«go Base Extehsion 
MATERIALS AND METHODS 

Ce«omfcO/«4. Genomic DNA was obtained from heahhv 
■nd,v,duals and patients suffering from sickle cell » 
« and mutated se,„ences have been evaluate c ^ """"^^ 

Sanger secuencing. oonvenuonally by standard 

w« stabl^hed a«, op«mi.ed to use the reacon produ tti^r; 
<^ r pur^cabon step for capturing with streptaVdin coated ,1 The 
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target amplffication for LOOP-PROBE reactions were performed with the 
loop-cod5 d(GAG TCA GGT GCG CCA TGC CTC AAA CAG ACA CCA 
TGG CGC, SEQ ID No. 58) as forward primer and B-1 1-bio d(TCT CTG 
TCT CCA CAT GCC CAG, SEQ ID. No. 59) as biotinylated reverse 

5 primer. The underiined nucleotide In the loop-cod 5 primer is mutated to 
introduce an invariant Cfol restriction site Into the amplicon and the 
nucleotides in Italics are complementary to a part of the amplified 
product. The total PGR volume was 50//! including 200-ng genomic 
DNA, lU Taq-polymerase (Boehringer-Mannheim, Cat# 1596594), 1.5 
10 mM MgQj, 0.2 mM dNTPs (Boehringer-Mannheim, Ca# 1277049), and 
10 pmol of each primer. A specific fragment of the G-globin gene was 
amplified using the following cycling condition: 5 min 94«C followed by 
40 cycles of : 30 sec @ 94«»C, 30 sec @ 56«»C, 30 sec @ 72«'C, and a 
final extension of 2 min at 72*»C. 
1 S Capturing and denaturation of biotinylated templates. 10//I 

.paramagnetic beads coated with streptavidin (lOmg/ml; Dynal, 
Dynabeads M-280 streptavidin Cat# 1 1 2.06) and treated with 5x binding 
solution (5 M NH4CI, 0.3M NH4OH) were added to 40 p\ PCR volume 
(10//1 of the amplified product was saved for check electrophoresis). 
20 After Incubation for 30 min at 37«»C the supernatant was discarded. The 
captured templates were denatured with 50 //I 100 mM NaOH for 5 min 
at ambient temperature, then washed once with 50 tA 50 mM NH OH 
and three times with 100 lA lOmM Tris.CI, pH 8.0. The single strlnded 
DNA served as templates for PROBE reactions. 
25 Primer oligo base extension (PROBE) reaction. The PROBE 

reactions were performed using Sequenase 2.0 (USB Cat# E707752 
including buffer) as enzyme and dNTPs and ddNTPs supplied by 
Boehringer-Mannheim (Cat# 1277049 and 1008382). The ratio between 
dNTPs (dCTP. dGTP. dTTP) and ddATP was 1:1 and the total used 
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concentration was 50 //M of each nucleotide. After addition of S/A 1-fold 
Sequenase-buffer the beads were Incubated for 5 min at 65«>C and for 
10 min at 37oc. During this time the partially self complementary primer 
annealed with the target site. The enzymatic reaction started after 
5 addition of 0.5 //I 1 00 mM dithiothreltol (DTT), 3.5 pi dNTP/ddNTP 
solution, and 0.5 /A Sequenase (0.8 U) and incubated at 37oc for 10 
min. Hereafter, the beads were washed once in 1-fold TE buffer (10 mM 
Tris, ImM EDTA, pH 8.0). 

Cfol restriction digest. The restriction enzyme digest was 
10 performed in a total volume of BjA using 10 U Cfol in 1-fold buffer L 

purchased from Boehringer-Mannheim. The incubation time was 20 min 
at 37«»C. 

anatysh"'"^ °^ c^^ai^/io^f/c products for mass spectrometric 

15 After tiie restriction digest, the supernatant was precipitated in 45 

//I H,0, 10//I 3M NH,- acetate (pH 6.5), 0.5 //I glycogen (10 mg/ml in 
water, Sigma, Cat# G1765), and 1 1 0 ytd absolute ethanol for 1 hour at 
room temperature. After centrifugation at 13,000 g for 10 min tiie pellet 
was washed in 70% etiianol and resuspended in 2//M8 Mohm/cm H,0 

20 The beads were washed in 100 //I 0.7 M NH^ citrate followed by 100 //I 
0.05 M NH,. citrate. TTie diagnostic products were obtained by heating 
the beads in 2 >wl 50 mM NH4OH at 80*C for 2 min. 

Sample preparation and analysis on MALDI-TOF mass 
spectrometry. 

25 Same preparation was performed by mixing 0.6 p\ of mati^ix 

solution (0.7 M 3-hydroxypicolinic acid, 0.07 M dibasic ammonium 
citrate in 1 : 1 H^O.CHaCN) witii 0.3 ii/l of either resuspended 
DNA/glycogen pellet or supernatant after heating tiie beads in 50 mM 
NH^OH n a sample target and allowed to air dry. The sample target 
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was automatically introduced in to the source region of an unmodified 
Perspective Voyager MALDI-TOF operated in delayed extraction linear - 
mode with 5 and 20 kV on the target and conversion dynode, 
respectively. Theoretical molecular mass (M,(calc)) were calculated from 
5 atomic compositions; reported experimental (M,(exp)) values are those of 
the singly-protonated form, 
RESULTS 

The LOOP-PROBE has been applied to the detection of the most 
common mutation of codon 6 of the human B-globin gene leading to 
10 sickle cell anemia. The single steps of the method are schematically 
presented in figure 72. For the analysis of codon 6, a part of the B- 
globin gene was amplified by PGR using the biotinylated reverse primer 
Bl Ibio and the primer loop-cod5 which is modified to introduce a Cfol 
recognition site (fig. 72a). The amplified product is 192 bp in length. 
15 After PGR the amplification product was bound to streptavidin coated 
paramagnetic particles as described above. The antisense strand was 
isolated by denaturation of the double stranded amplified product (Fig. 
72b). The intra-molecule annealing of the complementary 3' end was 
accomplished by a short heat denaturation step and incubation at 37 ®C. 
20 The 3' end of the antisense strand is now partially double stranded (Hg. 
72c). For analyzing the DNA downstream of the self annealed 3'-end of 
the antisense strand, the primer oligo base extension (PROBE) has been 
performed using ddATP, dCTP, dGTP, dTTP (Rg. 72d). This generates 
different products in length specific for the genotype of the analyzed 
25 individual. Before the determination of the length of these diagnostic 
products, the DNA was incubated with the Gfol restriction endonuci ase 
that cuts 5' of the xt nded product. This st p frees the stem lo p from 
the template DNA wher as the extended product still keeps attached to 
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the template. The extended products are then denatured by heating from 
the template stand and analyzed by MALDI-TOF mass spectrometry. 

Since the MALDI-TOF analyses were performed with a non- 
calibrated instrument, the mass deviation between observed and 
5 expected values was approximately 0.6% higher than theoretically 
calculated. Nevertheless, the results obtained were conclusive and 
reproducible within repeated experiments. In all analyzed supematants 
after the restriction digest the stem loop could be detected. Independent 
of the genotype, the stem loop has had in all analyses molecular masses 
10 about 8150 Da (expected 81 1 1 Da). An example is shown in Figure 

73a. The second peak in this figure with a mass of 4076 Da is a doubly 
charged ion of the stem loop. Figure 73b to 73d show the analyses of 
different genotypes as indicated in the respective inserts. HbA is the 
wildtype genotype and HbC and HbS are two different mutations in 
15 codon 6 of the (^globin gene which cause sickle cell disease. In the 
wildtype situation a single peak with a molecular mass of 4247 Da and 
another with 6696 Da are detected (Rg. 73b). The latter corresponds to 
the biotinylated PGR primer (S-1 1-bio) unused in the PGR reaction which 
also has been removed in some experiments. The former corresponds to 
20 the diagnostic product for HbA. The analyses of the two individual DNA 
molecules with HbS trait as well as compound heterozygosity (HbS/HbC) 
for the sickle cell disorder lead also to unambiguous expected results 
(Fig. 73c and 73d). 

In conclusion, the LOOP-PROBE is a powerful means for detection 
25 of mutations especially predominant disease causing mutations or 

common polymorphisms. The technique eliminates one specific reagent 
for mutation detection and, therefore, simplifies the process and makes it 
more amenable to automation. The specific extended product that is 
analyzed is cleaved off from the primer and is therefore shorter compared 
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to the conventional method- In addition, the annealing efficiency is 
higher compared to annealing of an added primer and should therefore 
generate more product. The process is compatible with multiplexing and 
various detection schemes (e^ single base extension, oligo base 
5 extension and sequencing). For example, the extension of the loop- 
primer can be used for generation of short diagnostic sequencing ladders 
within highly polymorphic regions to perform, for example, HLA typing or 
resistance as well as species typing (e^, Mycobacterium tuberculosis)). 

EXAMPLE 20 

MaSInvTs'^*"^** Dependent Amplification of CKR-B and Detection by 
MATERIALS AND METHODS 

Genomic DNA. Human genomic DNA was obtained from 
healthy individuals. 

PCR-Amplification and Purification. PGR amplification of a part of 
the CKR-5 gene was accomplished using ckrT7f as sense primer diACC 
TAG CGT TCA GTT CGA CTG AGA TAA TAG GAG TCA GTA TAG GAG 
CTC TGA TTT TGC ATA C {SEQ ID NO. 60). The underlined sequence 
corresponds to the sequence homologous to GKR-5, the bolded sequence 
corresponds to the T7-RNA polymerase promoter sequence and the italic 
sequence was chosen randomly. ckrSr was used as antisense primer 
d(AAC TAA GGG ATG TGG AGA AGA (SEQ ID NO. 61). Purification of 
the amplified product and removal of unincorporated nucleotides was 
carried out using the QIAquick purification kit (Qiagen, cat# 28104). In 
25 the final PGR volume of 50 /A were 200 ng genomic DNA, 1 U Taq- 

polymerase (Boehringer-Mannheim, catjff 1596594), 1.5 mM MgCIa 0.2 
mM dNTPs (Boehringer-Mannheim, cat# 1277049), and 10 pmol of ach 
primer. The specific fragment of the CKR-5 gene was amplified using the 
following cycling conditions: 5 min @ 94«»C followed by 40 cycles of 45 



15 



20 
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see @ 940c, 45 sec 520C, 5 sec @ 720C, and a final extension of 5 
min at 72.°C. 

T7-RNA Polymerase conditions. One third of the purified DNA 
(about 60ng) was used In the T7-RNA polymerase reaction. (Boehringer 
5 Mannheim, cat# 881 767). The reaction was carried out for 2h at 37oc 
according to the manufacturer's conditions using the Included buffer. 
The final reaction volume was 20 0.7 //I RNasin (33 U/M) had been 
added. After the extension reaction, the enzyme was inactivated by 
Incubation for 5 min at 65 "C. 

10 DNA digestion and conditioning of the diagnostic products for 

mass spec analysis. 

The template DNA was digested by adding RNase-free DNase I 

(Boehringer-Mannheimn, cat# 776 758) to the inactivated T7 mixture and 

incubation for 20 min at room temperature. Precipitation was carried out 

15 by adding 1 fA glycogen (10 mg/ml, Sigma, cat# G1765), 1/10 volume 

3M NH,.acetate (pH 6.5), and 3 volume absolute ethanol and Incubation 

for 1 hour at room temperature. After centrifugation at 13,000 g for 10 

min, the pellet was washed In 70o^ ethanol and resuspended In 3 //I 18 

Mohm/cm H^O. 1 fA was analyzed on an agarose gel. 

20 Sample preparation and analysis on MALDI-TOF mass 

spectrometry 

Sample preparation was performed by mixing 0.6 ,A of matrix 
solution (0.7 M 3-hydroxypicolinic acid, 0.07 M dibasic ammonium 
citrate In 1 : 1 H,0:CH3CN) with 0.3 ;^ of resuspended DNA/glycogen on a 
25 sample target and allowed to air dry," The sample target was introduced 
into the source region of an unmodified Rnnlgan VISION2000 MALDI- 
TOF operated in relectron mode with 5kV. The theoretical molecular 
mass was calculated form atomic composition; reported experimental 
valu s are those of singly-pronated form. 
30 RESULTS 
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The chemokine receptor CKR-5 has been identified as a major 
coreceptor in HIV-1 (see e.g., WO 96/39437 to Human Genome 
Sciences; Cohen, J. et a/. Science 275:1261). A mutant allele that is 
characterized by a 32 bp deletion is found in 16% of the HIV-1 
5 seronegative population whereas the frequency of this allele is 35% 

lower in the HIV-1 seropositive population. It is assumed that individuals 
homozygous for this allele are resistant to HIV-1. The T7-RNA 
polymerase dependent amplification was applied to identify this specific 
region of the chemokine receptor CKR-5 (Rgure 74). Human genomic 
10 DNA was amplified using conventional PCR. The sense primer has been 
modified so that it contains a random sequence of 24 bases that 
facilitate polymerase binding and the T7-RNA polymerase promoter 
sequence (Figure 75). The putative start of transcription is at the first 
base 5' of the promoter sequence. ckr5r was used as an antisense 
1 5 primer. PCR conditions are outiined above. The amplified product 

derived from wildtype alleles is 75 bp in length. Primer and nucleotides 
were separated from the amplification product using the Qiagen QIAquick 
purification kit. One third of the purified product was applied to in vitro 
ti-anscription with T7-RNA polymerase. To circumvent interference of the 
20 template DNA, it was digested by adding RNase-free ONase I. RNA was 
precipitated and this step also leaves the degraded DNA in the 
supernatant. Part of the redissolved RNA was analyzed on an agarose 
gel and the rest of the sample was prepared for MALDI-TOF analysis. 
The expected calculated mass of the product is 24560 Da. A dominant 
25 peak, that corresponds to an approximate mass of 25378.5 Da can be 
observed. Since the peak is very broad, an accurate determination of 
molecular mass was not possible. The peak does not correspond to 
r sidual DNA template. Rrst, the template DNA is digested, and second. 
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the DNA strands would have a mass of 23036.0 and 23174 Da 
respectively. 

This example shows that T7 RNA polymerase can effectively 
amplify target DNA. The generated RNA can be detected by Mass 
5 spectrometry. In conjunction with modified (e.g.. 3'- 

deoxy)ribonudeotides that are specifically incorporated by a RNA 
polymerase but not extended any further, this method can be applied to 
determine the sequence of a template DNA. 

EXAMPLE 21 

10 MALDI Mass Spectrometry of RNA Endonuclease Digests 
MATERIALS 

Synthetic RNA (Sample A:5'- 
UCCGGUCUGAUGAGUCCGUGAGGAC-3' (SEQ ID 62); sample B:5'- 
GUCACUACAGGUGAGCUCCA-3' (SEQ ID NO 63); sample C:5'- 
15 CCAUGCGAGAGUAAGUAGUA-3' (SEQ ID NO. 64)) samples were 
obtained from DNA technology (Aahus, Denmark) and purified on a 
denaturing polyacrylamide gel (Shaler, T. A. et al. (1996) Anal. Ch^m 
M:5766-579). Rnases T, (Eurogentec), U, (Calbiochem), A (Boehringer- 
Mannheim) and PhyM (Pharmacia) were used without additional 
20 purification. Streptavldin-coated magnetic beads (Dynabeads M-280 

Streptavidin, Dynal) were supplied as a suspension of 6-7x10" bead/ml 
(10 mg/ml) dissolved in phosphate-buffered saline (PBS) containing O 1 % 
BSA and 0.02% NaN^. 3-Hydroxypicolinic acid (3-HPA) (Aldrich) was 
purified by a separate desalting step before use as described in more 
25 detail elsewhere (Uttle, D. P. et al. (1995) Proc. Natl. A^^w . i. <;A 
S2, 2318-2322). 
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METHODS 

In vHro transcription reaction. The S'-biotinylated 49 nt in vitro 
transcript (SEQ ID No. 65): 

AGGCCOGCGGCAAGACGGAAAGACCAUGGUCCCUNAUCUGCCGCAGGAUC 
5 was produced by transcription of the plasmid pUTMS2 (linearized with 
the restriction enzyme BamHI) with T7 RNA polymerase (Promega). For 
the transcription reaction 3 //g template DNA and 50u T7 RNA 
polymerase were used in a SO /A volume of 1 u///l RNA guard (Rnax 
inhibitor, Pharmacia), 0.5 mM NTP's 1.0 mM 5'-biotin-ApG dinucleotide, 
10 40 mM Tris-HCI (pH 8,0), 6 mM MgCI^ 2 mM spermidine and 1 0 mM 

DTT. Incubation was perfomied at 37«C for 1 hour, then another aliquot 
of 50 units T7 RNA polymerase was added and incubation was continued 
for another hour. The mixture was adjusted to 2M NH4. acetate and the 
RNA was precipitated by addition of one volume of ethanol and one 
15 volume of isopropanol. The precipitated RNA was collected by 

centrifugation at 20,000 X g for 90 min at 4»C, the pellet was washed 
with 70% ethanol, dried and redissoived at 8 M urea. Further 
purification was achieved by electrophoresis through a denaturing 
polyacrylamide gel as described elsewhere (Shaler, T. A. et al. (1996) 
20 Anal. Chem. 68:576-579). The ration of 5'-biotlnylated to non- 
biotinyiated transcripts was about 3:1 . 

Ribonuclease assay. For partial digestion with selected RNases 
different enzyme concentrations ad assay conditions were employed as 
summarized in table VII. The solvents for each enzyme were selected 
25 following the suppliers' instructions. The concentrations of the syntiietic 
RNA samples and the in vitro transcript were adjusted to 5-10 x lO^^M. 
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The reaction was stopped at selected times by mixing 0.6 //I 
aliquots of the assay with 1 .5 //I of 3 HPA-solution. The solvent was 
subsequently evaporated in a stream of cold air for the MALDI-MS 
analysis. 

5 Umited alkaline hydrolysis was performed by mixing equal volumes 

(2.0^) of 25% ammonium hydroxide and RNA sample (5-10 x 10^ M) at 
60«»C. 1 /A aliquots were taken out at selected times and dried in a 
stream of cold air. For these samples it turned out to be important to 
first dry the digests in a stream of cold air, before 1.5pi of the matrix 
10 solution and 0.7 fA of NH^+ loaded cation exchanged polymer beads 
were added. 

The reaction was stopped at selected times by mixing 0.6 //I 
aliquots of the assay with 1 .5 of SHPA-solution. The solvent was 
subsequently evaporated in a stream of cold air for the MALDI-MS 
15 analysis. 

Umited alkaline hydrolysis was performed by mixing equal volumes 
(2.0 /A) of 25% ammonium hydroxide and RNA sample (5-10 x 10 « M) at 
60»C. 1 ^ aliquots were taken out at selected times and dried in a 
stream of cold air. For these samples it turned out to be important to 
20 first dry the digests In a stream of cold air, before 1 .5 >wl of the matrix 
solution and 0.7 //I if a suspension of NH^* loaded cation exchange 
polymer beads were added. 

Separation of 5'-b!otlnylated fragments. Steptavid in-coated 
magnetic beads were utilized to separate 5'-biotinylated fragments of the 
25 in vitro transcript after partial RNase degradation. The biotin moiety in 
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this sample was introduced during the transcription reaction initiated by 
the S'-biotin-pApG-dinucleotide. Prior to use, the beads were washed 
twice with 2 X binding & washing (b&w) buffer (20 mM Tris-HCI 2 mM 
EDTA, 2 M NaCI pH 8.2) and resuspended at 10 mg/ml in 2 x b&w 
5 buffer. Circa 25 pmoi of the RNA /n vitro transcript were digested by 
RNase U2 using the protocol described above. The digestion was 
stopped by adding 3 //I of 95% formamide containing 10 mM trans- 1 2 
diaminocyclohexane-N.N,N',N'-tetraacetic acid (COTA) at QO-C for 5 
min, followed by cooling on ice. Subsequently, capture of the 
10 biotinylated fragments was achieved by incubation of 6//I of the digest 
with 6 //I of the bead suspension and 3/W of b&w buffer at room 
temperature for 1 5 min. Given the binding capacity of the beads of 200 
pmol of biotinylated oligonucleotide per mg of beads, as specified by the 
manufacturer, ti.e almost 2-times excess of oligonucleotide was used to 
15 assure a full loading of tiie beads. The supernatant was removed, and 
the beads were washed twice with 6//I of H,0. The CDTA and 950/, 
Tomiamide at 90oc for 5 min. After evaporation of tiie solvent and tiie 
formam.de the ^2.5 pmol of fragments were resuspended in 2 //I H,0 
and analyzed by MALOI-MS as described above. 

Sample preparation for I^ALDI-IVIS. 3-Hydroxypicolinic acid (3- 
HPAI was dissolved in ultra pure water to a concenti-ation of ca. 300 
mM. Metal cations were exchanged against NH,* as described in detail 
previously. (Littie, D. P. et al. (1995)Proc. fsl^tl o^; 
2318-2322). Aliquots of 0.6//I of the analyte solution were mixed witii 
1 .5 //I 3-HPA on a flat inert metal substrate. Remaining alkali cations 
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preaent in «,e sample solution as well as on «,e substrate surface were 
removed by the addition of 0.7 pi of the solution of NH,* - loaded'oation 
exohange polymer beads. During solvent evaporation, the beads 
aocumulated in the center of the preparation, were not used for the 
5 analysis, and were easily removed with a pipette tip. 

Insuument. A prototype of the Vision 2000 (ThermBioanalysis 
Hemel, Hempstead, UK. reflectron time of flight mass spectrometer was 
"«d for «,e mass spectrometry. Ions were generated by irradiation with 
a frequency-tripled ND:VAe laser (35B nm. 6 ns; SpeWrum GmbH 
Be,*n 6em,any, and accelerated to 10 ke V. Delayed ion extraction was 
u«d for the acquisition of the spectra shown, as it was found to 
«.bst.„«ally enhance tf« signal to noise ratio and/or signal intensity 
The equivalent flight pati, lengti, of the system is 1 .7 m, the base 
pressure is 10- Pa. Ions were detected witi, a discrete dynode 
secondary-electron multiplier (R2362. Hamamatsu Photonics., equipped 
. wm, a conversion dynode for effective detection of high mass ions The 
tetal .mpact energy of ti,e ions on ti,e conversion dynode was adjusted 
to values ranging from 1 6 to 25 keV, depending on «,e mass to be 

20 Lrr*"!' ^"^'"^ <" "he SEM was digitized by a 

20 UCroy 9450 transient recorder (LeCroy, Chesti,ut Ridge, NY, USA, Xitf. 
•sampling „,e of up to 400 MHz. For storage and further evaluation 
ti.. data were transfer^d to a personal computer equipped witi, custom- 
made software .OLISSES,. Ail spectra shown were taken in «,e positive 
.on mode. Between 20 and 30 single shot spectia were averaged for 
«<is> each of the spectra shown. 
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RESULTS 

Specmcfty of 8nases. Combining base-specific RNA cleavage with 
MALDI-MS requires reaction conditions optimized to retain the activity 
and specificity of the selected enzymes on the one hand and complying 
5 with the boundary conditions for MALDI on the other. Incompatibility 
mainly results because the alkaline-ion buffers, commonly used in the 
described reaction, such as Na-phosphate, Na-citrate or Na-acetate as 
well as EDTA interfere with the MALDI sample preparation; presumably 
they disturb the matrix crystallization and/or analyte incorporation. Tris- 
10 HCI or ammonium salt buffers, in contrast, are MALDI compatible 

(Shaler, T. A. eta/. (1996) ADaLChem. 68:576-579). Moreover, alkaline 
salts in the sample lead to the formation of a heterogenous mixture of 
multiple salts of the analyte, a problem increasing with increasing number 
of phosphate groups. Such mixtures result In loss of mass resolution and 
15 accuracy as well as signal-to-noise ratio (Uttle, D. P. et a/. (1 995) Proc_ 
Natl. Acad. Sci MSA ^2:231 8-2322; Nordhoff, E./Cramer, R. Karas, 
M..Hillenkamp, F., Kirpekar, F., Kristiansen, K. and Roepstorff, P (1993) 
Nac/e/c Acids Res., 21, 3347-3357). Therefore, RNase digestions were 
earned out under somewhat modified conditions compared to the ones 
20 described in the literature. They are summarized above in table VII. For 
Rnase T„ A, CL, ad Cusativin, Tris-HCI (pH 6-7.5) was used as buffer. 
20 mM DAC provides the pH of 5, recommended for maximum activity 
of RNases U, and PhyM. The concenti^ation of 10-20 mM of these 
compounds were found to not interfere significantiy with the MALDI 
25 analysis. To examine the specificity of the selected ribonucleases under 
these conditions, three synthetic 20-25mer RNA molecules with different 
nucleotide sequences were digested. 
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The MALDI-MS spectra of Figure 77 shows five different cleavage 
patterns (A-E) of a 25 nt RNA obtained after partial digestion with 
RNases T,. Uj, PhyM, A, and alkaline hydrolysis. These spectra were 
taken from aliquots which were removed from the assay after empirically 
5 determined incubation times, chosen to get an optimum coverage of the 
sequence. As the resulting samples were not fractionated prior to mass 
spectrometric analysis, they contain all fragments generated at that time 
by the respective RNases. In practice, uniformity of the cleavages, can 
be affected by a preferential attack on the specific phophodiester bonds 
10 (Donis-Kelier, H., Maxam, A. M., and Gilbert, W. (1977) Nucleic Acids 
Res., 4, 1957-1978; Donis-Keller, H. (1980) Nucleic Acids Res., 3133- 
3142). The majority of the expected fragments are indeed observed in 
the spectra. It is also worth noting that for the reaction protocols as 
used, correct assignment of all fragment masses is only possible. If a 2', 
15 3'-cyclic phosphate group is assumed. It is well known that such cyclic 
phosphates are Intermediates in the cleavage reaction and get hydrolyzed 
in a second, independent and slower reaction step involving the enzyme 
(Richards, F. M., and Wycoff, H. W. in The Enzymes Vol. 4, 3rd Ed., (ed. 
Boyer, P.D.) 746-806 (1971. Academic Press, New York); Heinemann, U 
20 and W. Saenger (1 985) Pure Appl. Chem. 57. 41 7-422; Ikehara. M. et 
al.. (1987) Pure Appl. Chem. 59-965-968) Vreslow, R. and Xu, R. (1993) 
Proc. Nat. Acad. Sci. USA, 90. 1201-1207). In a few cases different 
fragments have equal mass of differ by as little as 1 Dalton.. In these 
cases, mass peaks cannot unambiguously be assigned to one or the 
25 other fragments. Digestion of two additional different 20 nt RNA 

samples was, therefore, perfonned (Hahner. S.. Kirpekar. F.. Nordoff, E., 
Kristiansen. K.. Ro pstorff, P. and Hillenkamp, F. (1996) Proceedings of 
the 44th ASMS Conference on Mass Spectrometry. Portland. Oregon) in 
order to s rt ut th se ambiguities. For all samples tested, the selected 
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nbonucleases appear to cleave exclusively at the specified nucleotides 
leading to fragments arising from single as well as multiple cleavages 

Jn Rgure 77, peaks, indicating fragments containing the original 5' 
temiinus, are marked by arrows. All non marked peaks can be assigned 
to internal sequences or those with retained 3'-terminus. For a complete 
sequence all possible fragments bearing exclusively either the 5'- or the 
3'-tenninus of the original RNA would suffice. In practice, the 5'- 
fragments are better suited for this purpose, because the spectra 
obtained after incubation of all three synthetic RNA samples contain the 
nearly complete set of originals of 5'-ions for all different RNases 
(Hahner, S., KIrpekar, F., Nordoff, E., Kristiansen, K., Roepstorff and 
H.llenkamp, F. (1 996) Proceedings of the 44th ASMS Conference on 
Mass Spectrometry. Portland, Oregon). Internal fragments are somewhat 
less abundant and fragments containing the original 3'-terminus appear 
suppressed in the spectra. In agreement with observations reported in 
the literature (Gupta, R. C. and Randerati,, K. (1977) Nucleic Acids Res 
4, 1 957-1 978), cleavages close to the 3'-terminus were partially 
suppressed in partial digests of the RNA 25 mer by RNase T, and U 

<--'ftheyareintemalorcontaintheoriginal5'-terminus).'Fragments 
from such cleavages appear as weak and pooriy resolved signals in the 
mass spectra. 

For larger RNA molecules secondary structure is known to 
influence the uniformity of the enzymatic cleavages (Donis-Keller H 
Maxam, A. M. and Gilbert, A. (1977) AiuCeic Acids Res. 8, 3133-3142) 
Th,s can, in principle be, overcome by altered reaction conditions In 
assay solutions containing 5-7 M urea, ti,e activity of RNases such as 
T,, Uj, A, CI3, and PhyM is known to be retained (Oonis-K Her H 
Maxam, A. M. and Gilbert, W. (1977) A/ucieic Acids Res.. 4 2527-2537. 
eoguski, M. S., Hieter, P.A., and L vy, C. C. (,980) J. Bio,. Chem..2!SS. 
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2160-2163; Donls-Keller, H. (1980) Nucleic Acids Res '., 8, 3133-3142, 
while RNA iis sufficiently denatured. UV-MALDI-analysis with 3-HPA as 
matrix is not possible under such high concentrations of urea in the 
sample. Up to a concentration of 2 M urea in the reaction buffer, MALOi 
5 analysis of the samples was still possible although significant changes in 
matrix crystallization were observed. Spectra of the RNA 20 mer 
(sample B), digested in the presence of 2 M urea still resembled those 
obtained under conditions listed in Table VII. 

Digestion by RNases which exclusively recognize one nucleobase 
10 is desirable to reduce the complexity of the fragment patterns and 

thereby facilitate the mapping of the respective nucleobase. RNases CL^ 
and cursavitin are enzymes reported to cleave at cytidylic acid residues. 
Upon limited RNase CL3 and cursativin digestion of the RNA-20mer 
(sample B) under non-denaturing conditions, fragments corresponding to 
15 cleavages at cytidylic residues were indeed observed (Rgure 78). Similar 
to the data reported so far (Boguski, M. S., Hieter, P. A. and Levy, C. C. 
(1980) J. Biol. C/je/n.,255, 2160-2163: Rojo, M. A., Arias, F. J., 
Iglesias, R., Ferreras, J. M., Munoz, R., Escannis, C, Soriano, F., Llopez- 
Fando, Mendez, E., and Girbes, T. (1994) Pianta. 194, 328-338). The 
20 degradation pattern in Figure 78, however, reveals that not every 
cyti'dine residue is recognized, especially for neighboring C residues. 
RNase CL3 is also reported to be suscepti'ble to the influence of 
secondary structure (Boguski, M. S., Heiter, P. A., and Levy, C. C. 
(1980) J Biol. Chem., 255, 2160-2163), but for RNA of the size 
25 employed in this study, such an influence should be negligible. 

Therefore, unrecognized cleavage sites in this case can be attributed to 
a lack f specificity f this nzyme. To confirm these data, a further 
RNase CLs-digestion was performed with the RNA 20mer (sample C). As 
a result of the sequence of this analyte, all three linkages containing 
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cytldylic acid were readily hydrolyzed, but additional cleavages at uridylic 
acid residues were detected as well. Since altered reaction conditions 
such as increased temperature (90oc), various enzyme to substrate 
ratios, and addition of 2M urea did not result in a digestion of the 
5 expected specificity, application of this enzyme to sequencing was not 
pursued furtfier. Introduction of a new cytidine-speclfic ribonuclease 
cusativin, isolated form dry seeds of Cacom/s sativas L. looked promising 
for RNA sequencing (Rojo, M. A., Arias, F. J., Iglesias,. R., Ferreras, J 
M., Munoz, R., Escarmis, C, Soriano, F., Uopez-Fando, J., Mendez E 
10 and Girbes, T. (1994) PJanta. 194, 328-338). As shown in Rgure 78 
not every cytidine residue was hydrolyzed and additional cleavages 
occurred at uridylic acid residues for tiie recommended concentration of 
the enzyme. RNases CL3 and cusativin will, tiierefore not yield tiie 
desired sequence information for mapping of cytidine residues and their 
15 use was not furtiier pursued. The distinction of pyrimidine residues can 
be achieved, however, by use of RNases witii multiple specificities, such 
as Physarum polycephalum RNase (cleaves ApN, UpN) and pancreatic 
RNase A (cleaves UpN. CpN) (see Figure 77). All 5'-terminus fragments 
generated by tiie monospecific RNase U, and apparent in the spectrum of 
20 Rgure 77C were also evident in the spectrum of tiie RNase PhyM digest 
(Figure 77D). Rve of tiie six uridilic cleavage sites could, tiiis way, be 
uniquely identified by tills Indirect metiiod. In a next step, ti,e 
knowledge of tiie uridine cleavage sites was used to identify sites of 
cleavage of cytidilic acid residues in tire spectrum recorded after 
25 incubation witfi RNase A (Figure 77E), again using exclusively ions 

containing tfie original 5'-terminus. Two of tfie four expected cleavage 
sit s w re identified tiiis way. A f w imitations are apparent fr m tfi se 
specti^a, if only tiie fragments containing the original S'-tenninus are used 
for tiie sequence determination. The first two nucleotides usually escape 
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the analysis, because their signals get lost In the low mass matrix 
background. Because of this, the corresponding fragments are missing in 
the spectra of the U- and C-speclfic cleavages. Large fragments with 
cleavage sites close to the 3'-terminus are often difficult to identify, 
5 particularly in digests with RNases r- and U^. because of their low yield 
(vide supra) and the often strong nearby signal of the non-digested 
transcript. Accordingly the cleavages in position 22 and 23 do not show 
up in the spectrum of the G-specific RNase T, (Figure 77A) and the 
cleavage site 24 cannot be identified from the spectra of the and 
10 PhyM digests (Rgures 77 C and D). Also site 1 6 and 17 with two 

neighboring cytidilic acids cannot be identified in the RNase A spectrum 
of Figure 77E. These observations demonstrate that a determination of 
exclusively the 5'-terminus fragments may not always suffice and the 
information contained in the internal fragments may be needed for a full 
15 sequence analysis. 

Finally, limited alkaline hydrolysis provides a continuum of 
fragments (Figure 77B), which can be used to complete the sequence 
data. Again, the spectrum is dominated by ions of fragments containing 
the 5'-terminus, although the hydrolysis should be equal for all 
20 phosphodiester bonds. As was true for the enzymatic digests, con-ect 
mass assignments requires one to assume that all fragments have a 2', 
3'-cyclic phosphate. The distribution of peaks, therefore, resembles that 
obtained after a 3'-exonuclease digest (Pieles. U., Zurcher, W., Schar, M. 
and Moser, H. E., (1993) Nucleic Acids Res. , 21, 3191-3196; Nordhoff, 
25 E. et aL (1 993) Book of Abstracts, 1 3* Internet. Mass Spectrom. Conf ., 
Budapest p. 218; Kirpekar, F., Nordhoff, E., Kristiansen, K., Roepstorff, 
P., Lezius, A. Hahn r, S., Karas, M. and Hillenkamp. F. (1994) Nucleic 
Acid Fes., 22, 3866-3870). In principle, the alkaline hydrolysis alone 
could, th refer , be used for a complete sequencing. This is, however. 
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on.y possible for quite smal. ongoribonucleotides, because larger 
fragrnent ions, dlfferir^g ir, mass by only a few mass units wH, not be 
resolved In tbe spectra and tbe mass of larger Ions cannot be deterged 
w.th ^e necessary accuracy of better than , Da. even If peaks are 

-----"—ed. THelnterpreta.onoftbespectr:part::3Hy 
frorn d.gests of unknown RNA samples Is substantially simplified. If 

to the mass spectrometric analysis. A procedure for this approach Is 
descnbed In the following section. 
10 Sep.r.tlonof5'.tforiny,ste^fr,^^,„,,, Streptavldln-coated 

magnefc beads (Dynal, were tested for the extraction of fragments 
conta.n,ng the original 5'-terminus from ths digests Mainr f T 

ana, Tr"" '"'^ ""^ -u.a,e. 1-7^^, 

»ep«v,d.„-.,„«„ ,„.e.3c«„„. *e inuc. co.p,ex was no, ,„und in «,e 
*«ALD, specua. Instead, signals o, «,e monon,eric subunN of 
strop^vidln and *e biotinylated DNA were de.ec«d. Whether .he 

MAU,I desorption process. Is no. su,rls,„«„. „ s.ep.!:dln 

s .™n„h,l.zed on a solid surface such as ™.,„e«c heads. «,e sal 

aB were h ^ """"^'^ "''^ "« ^ 

ZZZT: ~-ed and care Jiv 

washed before ,ncubab„n In «,e 3-HPA MALO, „,aMx. No analy.e 
s.9nals could be ob«l„ed from ««se samples. To .es, whett.er *e 

b.o«nylated species had been bound tttf,e beads el.oge*er ,„.• , 
*e ex.rac.ed and wash d beads was perf ,m«d bv heC a. sCc ^ 
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the presence of 95% formamide. This procedure is expected to denature 
the streptavidin, thereby breaking the streptavidin/biotin complex. Rgure 
79B shows the expected signals of the two biotinylated species, proving 
that release of the bound molecules in the MALDI process is the problem 
5 rather than the binding of the beads; Figure 79A shows a spectrum of 
the same sample after standard preparation, showing signals of all four 
analytes as a reference. Complete removal of the formamide after the 
elutlon and prior to the mass spectrometric analysis was found to be 
important, otherwise crystallization of the matrix is disturbed. Mass 
1 0 resolution and the signal-to-noise ration In spectrum 79B are comparable 
to those of the reference spectrum. These results testify to the 
specificity of the streptavidin-biotin interaction, since no or only minor 
signals of the non-blotlnylated analyte were detected after incubation 
with the Dynal beads. Increased suppression of nonspecific binding was 
1 5 reported through an addition of the detergent Tween-20 to the binding 
buffer (Tong, X. and Smith, L. M. (1992) Anal Chem., 64. 2672-2677). 
Although this effect could be confirmed in this study, peak broadening 
'affected the quality of the spectra due to remaining amounts of the 
detergent. The necessity of an elutlon step as a prerequisite for 
20 detection of the captured biotinylated species can be attributed to a 

stabilizing effect of the complex by the immobilization of the streptavidin 
to the magnetic beads. 

For practical application of tills solid phase metiiod to sequencing 
a maximum efficiency of binding and elutlon of biotinylated species is of 
25 prime importance: Among a variety of conditions investigated so far, 
addition of salts such as EDTA gave best results in the case of DNA 
sequencing by providing ionic str ngtii to tiie buffer (Tong, X. and Smith, 
L.M. (1992) >»/,a/C/,e/n.,64, 2672-2677). To examine such an effect 
on the s lid-phase method, several salt additives were tested for the 



PCr/(JS97/20444 



•184- 



blnding am elution of the S'-biotinylated RNA in w,^ ^ 

The resuH. are shown in 80. Ju^^ZZI Z T^"'" 

s<9na,..o-„oise ra«o„. and reso.u«oo of rLtl! 

5 e'«c.en„or«,ebindrng/e.u«on. Since COTA al " h T"'' 
'or .iva,en. ca«on. fe„na«on of proper aeco„C an IT""'" '"^' 
*e RNA is prevented. An improved se„si«vitv 1" 
has been demonstrated under soch condition^foTl! T 
samples hv eieotrospra, „ass spectro^etrvTJlal Ta T 
10 and MoCioskev. J.A. ,,996, * • 

THe i^prove^nt in the MACO, anaivsis is ^^ 'l'''' 
compared to the spectra,™ obtained for «,e soiuul ^ 
a.one ,R,„re 8,h,. hut «,e reproduoihiiitv Cl ir 7 "^"^"^ 
-bstan^aiiV .n^roved for the COTA/fonnamide^ 11 "ZT"'^ 
'B add-on to the improved hinding/eiudon. d,is additivTravr 
*e .ncorporation of the anaiyte into the matrix o^L u 
-iicina sisna, hroadenin, on the hi.h mass side waTchs " 
. formamide so>u«o„s containin, EDTA, COTA or 2^ 
. hydroxide. Since this effect is mo« „ anvnonium 
sn ^ prominent in case of piiv 

20 ammonium hydroxide and this .gen. was aiso used ,Z 1 . 

ca be assumed. ^ a<iduct ion formation 

The applicability of streptavidin-coated • 
.o RNA sequencing was -emons.ra.ed: reZ e J d^"^ ~ 
25 hio«ny.a.ed RNA * v*ro .anscrip. ,49 nt, (FiguTe r,, ' 
*re9ment pattern obtained after incubahon IS^^ 
^ectrum 8,A. Separation of the biotinyiated fraom " " 

compiexity Of the spect^m ,.gure 8,3,^^ ^^JT, : 
- captured by the beads, .e signaie in the spe:::^ ^rd""^ 
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and the increased number of signals In the low mass range indicate that 
even after stringent washing of the beads, some amounts of buffer and 
detergent used for the binding and elution remained. Further 
improvements of the method are, therefore, needed. Another possible 
5 strategy for application of the magnetic beads is the immobilization of the 
target RNA prior to RNase digestion by an elution of the remaining 
fragments for further analysis. Qeavage of the RNA was impeded in this 
case, as evidenced by a prolonged reaction time for the digest under 
otherwise identical reaction conditions. 

EXAMPLE 22 

Parallel DNA Sequencing Mutation Analysis and iUficrosateinte Analysis 
Using Pnmers with Tags and Mass Spectrometric Detection 

This EXAMPLE describes specific capturing of DNA products 

generated in DNA analysis. The capturing is mediated by a specific tag 

15 (5 to 8 nucleotides long) at tiie 5' end of tiie analysis product that binds 

to a complementary sequence. The capture sequence can be provided 

by a partially double stranded oligonucleotide bound to a solid support. 

Different DNA analysis (e.g., sequencing, mutation, diagnostic, 

microsatellite analysis) can be earned out in parallel, using, for example, 

20 a conventional tube or microtiter plate (MTP). The products are then 

specifically captured and sorted out via tiie complementary identification 

sequence on tiie tag oligonucleotide. The capture oligonucleotide can be 

bound onto a solid support (e.g., silicon chip) by a chemical or biological 

bond. Identification of tiie sample is provided by the predefined position 

25 of tiie capute oligonucleotide. Purification, conditioning and analysis by 

mass spectrometry are done on solid support. This method was applied 

for capturing sp cific primers that had a 6 base tag sequence. 

MATERIALS AND METHODS 

Genomic DNA. 
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f^fi Amplification 



PGR amplifications of part of th#» 

reverse pH™. The .o.a. PCR ^0.! ^'^^ -^^ « 
flenomio ONA. ,U Ta<,-po,y™erase 

'277049,. an. ,0 p.„, each JZ i^^'T 
10 9lob,„ ,e„e was a.p„ed us,„, L : """"" *^ 

- @ a«. a fi„„ ax,e„s,o„ „, , I ^.^i^' Zr 

IB *eparifled product was used, or «,epri J o^ba °' 
. <PBOB« or scuencn, reac«on, respecZr 
Ay/wer o//<7o base extension (PROBFI »«w 

oe.ec.-ono,pu.«.e.«a«:„:ne:::r::::~' 

codon S and 6 and a, codon 30 and ,n «,e IvITh 
20 was done ,n parallel (FIGURE 82A, B-TAg! '"'"^"""'V- 
(GTCGTCCCATGGTeCACCTeACTC Seq. ID No 681 . 
~..lyze codon 6 and 6 and 6.TAG2 (CGCTgtGGT^! "'"^ 

.0. ^.. as, .r.. analyses .::zi7z:::iTr''' 
p^n^r .s P.O.,. ..AG. p.^r:::r —^.^^^^ 

cone ntration each 25 ddATP /fi„«i ' 

a- d-NTPs Pun^asedt^l ^ZTlrrT" ""^^ 

100333.. oMox .-r^oseorcr:!:!"::""^^ 
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ThermoSequenase (Amersham, CAT#E79000Y). The cycling program 
was as follows: 5 min @ 94"C, 30 sec @ B3°C, 30 sec @ 72«»C and a 
final extension step for 8 min @ 72«>C. Sequencing was performed 
under the same conditions except that the reaction volume was 25 /A 
5 and the concentration of nucleotides was 250 //M for ddNTP. 

Capturing using TAG sequence and sample preparation 
The capture oligonucleotides cap-tag 1 
d(GACGACGACTGCTACCTGACTCCA Seq ID No. 70) and cap-tag2 
d(ACAGCGGACTGCTACCTGACTCCA Seq ID No. 71), respectively, were 
10 annealed to equimolar amounts of uni-as d(TGGAGTCAGGTAGCAGTC 
Seq ID No. 72) (FIGURE 82A). Each oligonucleotide had a concentration 
of 10 pmoMfA in ddHjO and incubated for 2 min @ 80«»C and 5 min @ 
37«C. This solution was stored at -20«»C and aliquots were taken. 10 
pmol annealed capture oligonucletides were bound to ^0 /A paramagnetic 
15 beads coated with streptavidin (10 mg/ml; Dynal, Dynabeads M-280 
streptavidin Cat# 1 12.06) by incubation for 30 min @ 37«»C. Beads 
were captured and the PROBE or sequencing reaction, respectively, was 
added to the capture oligonucleotides. To facilitate binding of B-TAGl 
abd B-TAG2, respectively, the reaction was incubated for 5 min @ 25«'C 
20 and for 30 min @ 1 6oC. The beads were washed twice witii ice cold 
0,7 M NH4 Citrate to wash away unspecific bound extension products 
and primers. The bound products were dissolved by adding 1 p[ DDHjO 
and incubation for 2 min @ 65*»C and cooling on Ice. 0.3/il of the 
sample were mixed with 0.3//I matrix solution (saturated 3-hydroxy- 
25 picolinic add, 10% molar ratio ammonium-citrate in acetonitrile/water 
(50/50. v/v)) and allowed to air dry. The sample target was 
automatically introduced into the source region of an unmodified 
Perspective Voyager MALDI-TOF operated in delayed extraction linear 
mode with 5 and 20 kV on the target and conversion dynode. 
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respectively. Theoretical average molecular mass (M,(calcH were 

calculated from atomic compositions; reported expedmen,,, M.(M (exDi. 

values are those of the singly-pronated form. 
RESULTS 

S Specific capturing of a mixture of extension products by a short 

complementary sequence has been applied to isolate sequencing and 
pnmer digo base extension (Pnosa) products, m. method was used 
for the detection of putative mutations in tf,e human 8-globi„ gene a. 
codon 5 end 6 and at codon 30 and lVS-1 donor site; respectivelv 

and B1 1 . The amplification product was purified and ti,e nucleotides 
separated. One flfth of ti,e purified product was used for analyses by 
pnmer oligo base extension. To analyze bod, sites in a single reactiol 
pnme.. 8-TAG, and 6-TAQ2. were used respectively. .^TAG, binds 
u^eam of codons 5 and 6 and «-TAG2 upstream of codon 30 and the 
•VS., donor s«e. Extension of «,ese primers was perfo™,ed by cycling 
- *e presence of ddATP and dCTP. dGTP and dTTP, teading to specific 
products, depending on ti,e phenotype of ti,e individual. The reactions 
were ti,en mixed witi, tt,e capture oligonucleotides. Caphire 
Oligonucleotides include tiie biotinylated capti^re pnmer cap-tag, and 
cap-tag2, respectively. They have 6 bases at the 5' end «,at are 
complementary to ti» 5' end of 6-TAG, and B-TAG2. respectively 
T1.erefore. tf«y specificaUy capture these primers and ti,e extended 
products. By annealing a universal oligonucleotide ,uni-as) to the capture 

«ra«fed molecule where only the capture sequence stays single 
stiranded fPigur 82,. ™s molecule is ti,en bound to streptavWin coated 
paramagn tic particles, to which tt,e PROBE or sequencing reaction 
respectively is added, -n, mixture was washed to bind only the ' 
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speclfically annealed oligonucleotides. Captured oligonucleotides are 
dissolved and analyzed by mass spectrometry. 

PROBE products of one individual (Rg. 83) show a small peak with 
a molecular mass of 7282.8 Da. This corresponds to the unextended B- 
5 TAG1 that has a calculated mass of 7287.8 Da. The peak at 8498.6 Da 
corresponds to a product, that has been extended by 4 bases. This 
con-esponds to the wildtype situation. The calculated mass of this 
product is 8500.6 Da. There is no significant peak indicating a 
heterozygote situation. Furthermore only B-TAG1 and not B-TAG2 has 
10 been captured, indicating a high specificity of this method. 

Analyses of what was bound to cap.tag2 (Figure 84) shows only 
one predominant peak with a molecular mass of 9331 .5 Da. This 
corresponds to an extension of 8 nucleotides. It indicates a homozygous 
wildtype situation where the calculated mass of the expected product is 
15 9355 Da. There is no significant amount of unextended primer and only 
B-TAG2 has been captured. 

To prove that this approach is also suitable for capturing specific 
sequencing products, the same two primers 6-TAGl and B-TAG2, 
respectively, were used. The primers were mixed, used in one 
20 sequencing reaction and then sorted by applying the above explained 
method. Two different termination reactions using ddATP and ddCTP 
were perfonned with these primers (Figures 85 and 86, respectively). All 
observed peaks in the spectrograms correspond to the calculated masses 
in a wildtype situation. 

25 As shown above, parallel analysis of different mutations (e.g.. 

different PROBE primers) is now possible. Further, the described method 
is ultabi for capturing specific sequencing pr ducts. Capturing can b 
used for separation of different sequencing primers out of one r action 
tube/well, isolation of specific multipl x-amplified products, PROBE 
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products, etc. Conventional me«,„<,s. like cycle sequencing a™, 
conventional volumes can be used. A univereal chip design permits «. 
use o, many different applications, F„,tHer. this mett.„d can r 
automated for high throughput. 

5 

EXAMPLE 23 
Deletion Detection by Mass-Spectrometry 

Various formats can be employed for mass spectrometer det«of 
Of a de.et.on within a gene. For example, molecu Jmass ofTd k. 
standard amplified product can be determined or eith k 
10 strands of a double stranded product can ::tir:::r:Z 
measured as described in previous examples. 

Alternatively, as described herein, a specific en^matic reaction 

^otem^ed by mass spectrometry. Tl.e deletion size can be up to 
detection of the wildtypc and mutated aliele. By simultaneous detection 

^e tm::::" '^—^ - - ^ ~ 

20 MATERIALS AND METHODS 

Genomic DNA 

individu"!"^^ """"^ """^ """"^ '^o- ---v 

PCR amplification 

tou. ^'""*'"'°"'°"°'*''='«^'°'^^™"«»'>'-hed and optimized 
to u^ *e reaction products witi,ou. a h,m,er purification step for 
capn.nng v«ti, sti ptavidin coated beads. THe primers for targe. 
ampl.fication and for PROBE reactions were as follows- 
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CICRA.F:d(CAG CTC TCA TTT TCC ATA C SEQ ,D NO 73, w 
CKRA-R «o: <«AGC CCC AAO ATG ACT ATC S6Q ,0. NO 74, 'kR . 

62 C. 6 seconds @ 72.C. and a «„a, e«e„,i,„ o, s „i„«es a, 72 c 
The „„a, vo,u„« was 50^ inCudln^ 200 „g genomic DNA ,u Tea 
polymerase (Boehringer^annheim. Ca, * ,696594, , 6 M 
Mm DNTPS (eoehrinaer-Mannheim. Ca. * ,277049 o „ . ^ ^ ° ' 

'°'^P°™'"»9"™"<= beads coated with streptavidindo mo, . 
Oyna,. Dv™be«.s M.280 streptavldin Cat * ,,2.06, T,!^ ' 

(6M NH.C.. 0.3 M NH.OH, were added to <Z ^T^. 
M of PCR reecaon were saved for a.ecu,ph„res,ll IfterTn " 
incubation for 30 min at 37»c '^'^>- After brndrng by 

IB . , " ^ ^upe^natent was discarded r.„. 

IB templates were denatured with 50 ^ of ,00 Mm NaOH foT^ 

.ambient temperature, washed once with 50M 50 Mm N.^ " 

served as templates for PROBE reactions. 

DNTPS/ddntp ach SO irfy/l and O 5 ,.1 Q« ^l»TT, 3.5//I 

• . ^/nvi ana 0.5 //I Sequenase (0.8 U) were aHw^w . 

incubated at 37 C for 10 min. 

r-^ Treatment of DNA 
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To generate blunt ended DNA, amplification products were treated 
with T4 ONA polymerase (Boehringer-Mannheim Cat# 1004786) The 
reactions were earned out according to the manufacturer's protocol for 
20 min, at 1 1 «»C. 

5 Direct Size Determination of Extended Products 

To detennine the size of the amplified product, MALDI-TOF was 
applied to one strand of the amplification product, samples were bound 
to beads, as described above, conditioned and denatured, as described 
below. 

10 DiS/A Conditioning 

After the PROBE reaction the supernatant was discarded nd the 
beads were washed first In 50 //I 700 mM NH,-citrate and second 50^ 
SO mM NH,-citrate. The generated diagnostic products were removed 
for the template by heating the beads in 2 //I H^O at 80«»C for 2 min. 
15 The supernatant was used for MALDI-TOF analysis. 

Sampie Preparation and Analysis witii MALDI-TOF Mass 
Spectrometry 

Sample preparation was performed by mixing 0.6 //I of matrix 
solution (0.7 M 3-hydroxypicolinic acid, 0.07 M dibasic citi^ate in 1 1 
20 H,0:CH3CN) with 0.3 ,A of diagnostic PROBE products in water on a 
sample target and allowed to air dry. Up to 100 samples were spotted 
on a probe target disk for introduction into the source region of an 
unmodified Perspective Voyager MALOI-TOF instrumem operated in 
linear mode with delayed extraction and 5 and 30 kV on the target and 
25 conversion dynode, respectively. Theoretical average molecular mass 

(M,(calc)) of analytes were calculated from atomic compositions, reported 
experimental M,(H(exp)) values are ti,ose of tiie singly-pronated form 
detennined using internal calibration with unextended primers in the case 
of PROBE reactions. 
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Conventional Analyses 

Conventional analyses were performed by native ooi.» . 
Oe. electrophoresis according to standard protocor rrj^'"^^'^^^^^ 
-products were denatured with formamide prior to oa Jn 
and sta^e^ with ethidiu. hromide or sij, rZr ^ ^ 



The CKR-5 status ot 10 randomly chosen ONA , 
indlvldoeb wore analyzed. Leukocyte ONA Z ,2,^7. 
^Uouot o, the an^imed product warena"!:;; r„d , " ^" 
10 «e. el^^opho^s. and sHye. staWn« o, L 0^1^ rr^' 

CKR-5, whereas the other 6 samples showed one h»„w 
a homozy^us «e„e ,P„„r. 88,. In the case lerer; ^ 
Observed, they corraspond to the expected s,» 077^1 , 
' = .ene and 43 hp „r the al.le with the delete: ^ J, 

hand was ohserved. the ^ was ahou. 7S hp whL i^^ra ""^ 
homozygous wildtype CKR-S allele. One DNA sample , 
presumably heterozygous one from a homozygous ,nd -hT " 
for all norther analysis. To dete„.lne the JZl^^'l 
ao product. DNA wa, suhiected to matHx assist" ' ^'"^ 

dosorption/ionizaaon coupled witt, time o, flight analysis (MALDI TOH 
Double stranded ONA, bound to streptavidin coa.-H '"'^'^"0^. 
particles, was denatured and the strand rei asL :! T""'"'"" 

alyzed. SSA shows a spe^^h 07, o^'T"' 

^ was supposed to be heterozygous according to 1 iTl .^^ " 
polyacrylsmide gel electrophoresis (Figure 88. , " 

- sense strand ,or a .Idtype J^ T^Z oT^TZ "^'^ 
atrand carrying the deletion allele ,3,43 (Rgure 87 a„I r t 
™nv «,em,osta.e polymerases ^^IZ: rj^Jl T 
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3' end of the product, those masses were also calculated. They are 
23349 and 13456 Da. The masses of the observed peaks (Figure 89A) 
are 231 19 Da. which corresponds to the calculated mass of a wildtype 
DNA strand where an adenosine has been added (23349 Da) Since n 
5 peak with a mass of about 23036 Da was observed, the polymerase 
must have qualitatively added adenosine. Two peaks, which are close to 
each other, have a mass of 13451 and 13137 Da. This corresponds to 
the calculated masses of the allele, with the 32bp deletion. The higher 
mass peak corresponds to the product, where adenosine has been added 
and the lower mass peak to the one without the unspecific adenosine 
Both peaks have about the same height, indicating tiiat to about half of 
the product adenosine has been added. The peak witi, a mass of 1 1682 
Da ,s a doubly charged molecule of the DNA corresponding to 23319 Da 
(2 X 1 1 682 Da = 23364 Da). The peaks with masses of 6732 and 
15 6575 Da are doubly charged molecules of the one witii masses of 13451 
and 13137 Da and the peak with 7794 Da corresponds to ti,e triply 
charged molecule of 2331 9Da. Multiple charged molecules are routinely 
•dentified by calculation. Amplified DNA derived from a homozygous 
.ndn,idual shows in ti.e spectrograph (Figure 89C, one peak witii a mass 

23349.6 and a much smaller peak witf, a mass of 23039.9 Da The 
higher mass peak corresponds to DNA resulting from a wildtype allele 
wti, an added adenosine, tfiat has a calculated mass of 23349 Da The 
lower mass peak corresponds to the same product witiiout adenosine 
Three furtfier peaks witii a mass of 1 1^86, 7804.6 and 5852 5 Da 
25 correspond to doubly, triply and quadruply charged molecules. 

The unspecific added adenine can be removed from the 
amplified DNA by tieatinent of the DNA and T4 DNA polymerase DNA 
derived from a heterozygous and a homozygous individual was analyzed 
after T4 DNA polymerase treatinent. Figure 898 shows th spectrograph 
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derived from heterozygous DNA, The peak corresponding to the 
wildtype strand has a mass of 23008 Da indicating that the added 
adenine had been removed completely. The same Is observed for the 
strand with a mass of 13140 Da. 
5 The other three peaks are multiply charged molecules of the parent 
peaks. The mass spectrograph for the homozygous DNA shows one 
peak that has a mass of 23004 Da, corresponding to the wildtype DNA 
strand without an extra adenine added. All other peaks are derived from" 
multiply charged molecules of this DNA. The amplified products can be 
10 analyzed by direct determination of their masses, as described above, or 
by measuring the masses of products, that are derived from the amplified 
product in a further reaction. In this "primer oligo base extension 
(PROBE)- reaction, a primer that can be internal, as it is In the nested 
PCR, or identical to one of the PGR primers. Is extended for just a few 
15 bases before the termination nucleotide is incorporated. Depending on 
the extension length, the genotype can be specified. CKRA-F was used 
as a PROBE primer, and dATP/dGTP and ddTTP as nucleotides. The 
primer extension is ACT in case of a wildtype template and AT in case of 
the deletion (Rgure 87). The corresponding masses are 6604 Da for the 
20 wildtype and 6275 Da for the deletion, respectively. PROBE was applied 
to two standard DNAs. The spectrograph (Figure 90A) shows peaks 
with masses of 6604 Da corresponding to the wildtype DNA and at 6275 
Da corresponding to the CKR-5 deletion allele (Table VIII). The peak at a 
mass of 5673 Da corresponds to CKRA-F (calculated mass of 5674 Da). 
25 Further samples were analyzed in analogous way (Figure 908). It is 
unambiguously identified as homozygous DNA, since the peak with a 
mass of 6607 Da corresponds to the wildtype allele and the peak with a 
mass of 5677 Da to the unextended primer. No further peaks were 
observed. 
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The example demonstrates that deletion analysis can be performed 
by mass spectrometry. As shown herein, the deletion can be analyzed 
by direct detection of single stranded amplified products, or by analysis 
of specifically generated diagntic products (PROBE). In addition, as 
shown in the following Example 26, double stranded DNA amplified 
products can be analyzed. 



1 

1 wildtype w/o A 
1 wildtype with A 
1 deletion w/o A 
1 deletion with A 


Calculated Mass 

23036 

23349 

13143 

13456 


Measured Mass 

23039/23009/23004 
23319/23350 
13137/13139 
13451 


PROBE 






wildtype 


6604 


6604/6608 


deletion 


6275 


6275 







15 



20 



All masses are in Dalton. 

EXAMPLE 24 

Pentapiex tc-PRQBE 
SUMMARY 

The multiplexing of thermocycling primer oligo base extension (tc- 
PROBE) was performed using five polymorphic sites in three different 
apolipoprotein genes, which are thought to be involved in the 
pathogenesis of atherosclerosis. The apolipoprotein A IV gene (codons 
347 and 360), the apolipoprotein E gene (codons 112 and 158), and the 
25 apolipoprotein B gene (codon 3500) were examined. All mass spectra 
were easy to interpret with respect to the five polymorphic sites. 
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MATERIALS AND METHODS 
PGR Amplification 

Human leukocytic genomic DNA was used for PGR. Usted below 
are the primers used for the separated amplification of portions of the 
5 Apo A IV, Apo E and the Apo B genes: 

Apo A IV: A347F: 5'-CGA GGA GCT CAA GGC CAG AAT 3' 

(SEQIDNO. 75) ^ ^i-<i 

A360 R-2-bio: 'S'-CAG GGG CAG CTC AGC TCT C-3- 

(SEQ ID NO. 76) . 

Apo E: ApoE-F: 5'-GGC ACG GCT GTC CAA GGA-3' 

(SEQ ID NO. 77) 

ApoE-R bio: •5'-AGG CCG CGC TCG GCG CCC TC 3' 

(SEQIDNO. 78) ^v.^, d 

10 ApoB: ApoB-F2bio: -S'-CTT ACT TGA ATT CCA AGA GC-3' 

(SEQ ID NO. 79) 

Apo B^: 5'-GGG CTG ACT TGC ATG GAC CGG A-3' 

(SEQ ID NO. 80) ^ 

* biotinylated 

Taq polymerase and lOx buffer were purchased from Boehringer- 
15 Mannheim (Germany) and dNTPs for Pharmacia (Freiburg, Germany). 
The total PGR reaction volume was 50 ^ including 10 pmol of each 
primer and 10% DMSO (dimethylsulf oxide, Sigma) (no DMSO for the 
PGR of the Apo B gene), with ~200 mg of genomic DNA used as 
template and a final dNTP concentration of 200 a/M. Solutions were 
20 heated to 80«»G before the addition of 1 U Taq polymerase; PGR 

conditions were: 5 min at 95<»C, followed by 2 cycles 30 sec 940C 30 
sec 62«G, 30 sec 72<»G, 2 cycles 30 sec 94«»G 30 sec 58oc, 30 sec 
72«G, 35 cycles of 30 sec at 94<»G, 30 sec at 56'G, 30 sec at 72«»G 
and a final extension time of 2 min at To remove unincorporated 

25 primers and nucleotides, amplified products were purified using the 

"QIAquick- (Qiagen, Gemnany )kit. with elution of the purified products 
in 50//L of TE buffer (10 mM Tris-HGI, ImM EDTA, pH 8.0). 
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Binding of the amplified product on beads 
10 //I of each purified amplified product was bound to 5//I 
DynaBeads (Dynal, M-280 Streptavidin) and denatured according to the 
protocol from Dynal. For the pentaplex tc-PROBE reaction the three 
5 different amplified product (bound on the beads) were pooled. 
Tc-PROBE 

For the PROBE reaction the following primers were used: 
(Apo A) P347: 5'-AGC CAG GAG AAG-3' (SEQ ID NO. 81 ) 
(Apo A) P360: 5'-ACA GCA GGA ACA GCA-3' (SEQ ID NO 82) 
10 (Apo E) P1 12: 5'-GCG GAG ATG GAG GAG GTG-3' (SEQ ID NO 83) 
(Apo E) PI 58: 5'-GAT GCC GAT GAG CTG CAG AAG-3'(SEQ ID 
NO. 84) 

(Apo B) P3500: 5'-GTG CCC TGC AGC TTC ACT GAA GAC-3'(SEQ ID 
NO. 85) 

15 The tc-PROBE was carried out in a final volume of 25 fA containing 

10 pmol of each primer listed above, 2.5 U Themioquenase (Amersham) 
2-5 pL Thennoquenase buffer, and 50 //M dTTP (final concentrations) 
and 200^ of ddA/C/GTP, respectively. Tubes containing the mixture 
were placed in a thermocycler and subjected to the following cycling 
20 conditions: denaturation (94oC) the supernatant was carefully removed 
from the beads and 'desalted' by ethanol precipitation to exchange 
nonvolatile cations such as Na+ and K+ witi, NH, + . which evaporated 
dunng the ionization process; 5//L 3M ammonium acetate (pH 6 5) O 5 
M. glycogen (10 mg/mL. Sigma), 25^H,0, and 1 1 0 M- absolute ethanol 
25 were added to 25 pL PROBE supernatant and incubated for 1 hour at 
4<»C. After a 10 min. centrifugation at 13,000 X g, the pellet was 
washed in 70% etiianol and resuspended in 1 ^ 18 Mohm/cm H,0. A 
0.35 pL aliquot of resuspended DNA was mixed with 0.35 pL matrix 
solution (0.7 M 3-hydroxypicolinic acid (3-HPA), 0.07 M ammonium 
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citrate in 1:1 H,0:CH3CN) on a stainless steel sample target disk and 
allowed to air dry preceding spectrum acquisition using the Thermo 
Bioanalysis Version 2000 MALDI-TOF operated in reflectron mode with 5 
and 20 kV on the target and conversion dynode, respectively. Theoretical 
average molecular masses (M,(calc)) of the fragments were calculated 
from atomic compositions. External calibration generated from synthetic 
(ATCG)„ oligonucleotide (3.6-18kDa) was used. Positive ion spectra 
from 1-37500 Da were collected. 
RESULTS 

Table VIII shows the calculated molecular masses of all 
possible extension products including the mass of the primer itself Rg 
91 shows a respective MALDI-TOP MS spectra of a tc-PROBE using three 
different templates and 5 different PROBE primers simultaneously in ne 
reaction. Comparison of the observed and calculated masses (see table 
15 VIII) allows a fast genetic profiling of various polymorphic sites in an 
individual DNA sample. The sample presented in Figure 91 is 
homozygous for threonine and glutamine at position 347 and 360 
respectively, in the apolipoprotein A IV gene, bears the epsilon 3 allele 
homozygous in the apolipoprotein E gene, and is also homozygous at the 
20 codon 3500 for arginine in the apolipoprotein B gene. 
TABLE VIII 



25 





f SEQ 10 


1 mass 


allele 


1 Apolipoprotein A IV 






R 5'-AGCCAGGACAAG-3' (347) 


86 


3688.40 


unextended 


1 5'-AGCCAGGACAAGTC-3' 


87 


4265.80 


primer 
347Ser 


1 5'-AGCCAGGACAAGA-3' 


88 


3985.60 


347Thf 


1 5'-ACAGCACCAACAGCA-3'(360) 


89 


4604,00 


unextended 
primer 


1 5'-ACAGCAGGAACAGCATC-3' 


90 


5181,40 


360HIS 
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SEQ ID I mass 



15 



1 5*-ACAGCAGGAACAGCAG-3' (112) 


91 


1 4917.20 


1 360Gln j 


1 ^ Apolipoprotetn E 






1 5'-GCGGACATGGAGGACGTG-3' (112) 


92 


5629.60 


unextended I 
primer | 

112Cys 1 


1 5'-GCGGACATGGAGGACGTGGC-3' 


93 


6247.00 


1 S'-<SCGGACATGGAGGACGTGC-3' 

1 5'-GATGCCGATGACCTGCAGAAG.3'(158) 


94 
95 


5902.80 
6489.20 


112Arfl 1 
unextended 11 


5'-GATGCCGATGACCTGCAGAAGC-3^ 
5*-GATGCCGATGACCTGCAGAAGTG-3' 


96 
97 


6753.40 
7097.60 


primer D 
ISSArg ^ 
1 58Cys II 



Apolipoprotein B-10 0 

S'-GTGCCCTGCAGCTTCACTGAAGAC-3' 
(3S00) 

5'-GTGCCCTGCAGCTTCACTGAAGACTG-3' 
5'-GTGCCCTGCAGCTTCACTGAAGACC-3' 



98 


7313.80 


99 


7931.20 


100 


7587.00 



unextended 
primer 

3500Gln 

3500Arg 



20 



25 



EXAMPLE 25 

S^^^^l^""" "^^ '^•^ "VWLDr-TOF M^ss 

MATERIALS & METHODS 

Thirty-five cycles of PGR reactions were performed .n a 96 weH 
microliter plate with each well containing a total volume of 50 //I 
.ncluding 200 ng genomic DNA, 1 unit Taq DNA polymerase, 1.5 mM Mg 
CI., 0.2mM dNTPx, 10 pmol of the forward primer and 6 or 8 of the 
biotmylated reverse primer. The sequences of PGR primers prepared 
according to established chemistry (N.D. Sinha, J. Biemat H Kter 
ISfiShedOetL 24:5843-5846 (1983, are as follows: exon 5:d(biotin- 
TATGTGTTGAGTTGTGGGG SEQ ID NO. 101) and d(biotin- 
GAGAGGCGTGGGGACCCTG SEQ ID NO. 102); exon 6: 
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D(ACGACAGGGCTGGTTGCC SEQ ,D NO. 103, and d(blotin- 
ACTGACAACCACCCTTAAC SEQ ID NO. 104,; axon 7- 

SEQ ,D NO. 105, and d(blotin- 
CACAGCAGGCCAGTGTGC SEQ ID NO. 106; axon 8- 

5 ^|^«QACCTGATTTCCTTACTGSEQIDNO.l07,an^ 
TGAATCTGAGGCATAACTG SEQ ID NO. 108,. 

To each well of the 96-well microlfter plate cont«i«- 

Then beads were treated with 0,1 M NaOH a* 

nnn ,o„o„«. av one wash w«. BO ™m1 oZ7^'^"""' '°' ' 
. m.n ,o-,o„e. ^ one wash 50 I wHa. """""'^ 
Four dideoxy termination reactions were carried ou, in aeoarat. 
16 we„s o, ti,e microliter piate. A .ota, o, 84 reactions ,2, prir^r 
reactions/primer, can be pertom»d in a sinflie microliter piar To ea H 
we.l containing immo>.,i.ed sin^e-stranded tempiate. a tottoir:; o 
>A reaction mi«ure was added including ,x reaction buffer ,0 Imol 1 
~- P^"- -60 mM o, dNTPs. 35 mM of one o, t^e dd^^"' 
20 and1^2unitsofT1,emK.se,uenase(Amersham) Seouen„i„ 

wa™ camed out on . .hem.a> cvCer usin, non Jvcl^:::,? "T"' 
;n..n. 50.C. , min. 50-0 to ..-c. rampin, O. I^! rand ^^.^ ^1' 

0.05 M ammoraum citrate. Sequencing products were then . 

r r """" '"'^ ^ - "'^'^^z . 

m-n. TIH. supernatant was used for MAU5I.TOF MS analysis 

Matilx was prepared as described in Kter. e. al ,K,er. H. «», 
^"-^^laimji: i,23-„28„996„. Ws saturated matil 
solution was ti,en d,u.ed ,.52 times witi, pure water before usT o 3^ 
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of the diluted matrix solution was then diluted 1.52 times with pure 
water before use. 0.3 //I of the diluted matrix solution was loaded onto 
the sample target and allowed to crystallize followed by addition of 0 3 ^ 
of the aqueous analyte. A Perseptive Voyager DE mass spectrometer 
5 was used for the experiments, and the samples were typically analyzed 
in the manual mode. The target and middle plate were kept at + 1 8.2 kV 
for 200 nanoseconds after each laser shot and then tiie garget voltage 
was raised to + 20kV, the ion guide wire in the flight tube was kept at - 
2V. Normally, 250 laser shots were accumulated foe each sample. -The 
10 original spectrum was acquired under 500 MHz digitizing rate, and the 
final spectrum was smootiied by a 455 point average (Savitsky and 
Golay, (1964) Analytical Chemistry, 36:1627). Default calibration of the 
mass specti^ometer was used to identify each peak and assign 
sequences. The theoretical mass values of two sequencing peaks were 
15 used to recalibrate each spectrum. (D.P. Uttie, T.J. Cornish, M.J. 
O'Donnel, A. Braun, R.J. Cotter, H. Kter, Anal. Chem., submitted) 
RESULTS 

Alterations of the p53 gene are considered to be a critical step in 
the development of many human cancers (Greenblatt, et al., (1 994) 

20 Cancer Res. 54, 4855-4878; C.C. Harris, (1996) J. Cancer! 73, 261- 
269; and D. SIdransky and M. Hollstein, (1996) Annu.Res.Med., 47,285- 
301). Mutations may serve as molecular indicators of clonality or as 
eariy markers of relapse In a patient witii a previously Identified mutation 
•n a primary tumor (Hainaut, et al., (1 997) Nucleic Acid Res., 25, 1 51- 

25 1 57). The prognosis of tiie cancer may differ according to ti,e nature of 
the p53 mutations present (M.S. Goh et al., (1995) Cancer Res, 55, 
5217-5221). Since tiie discovery of the p53 gene, more than 6000 
different mutations have been detected. Exons 5-8 were selected as 
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sequencing targets where most of the mutations cluster (Hainaut et al 
(1997) Nucleic Acids Res., 25, 151-7). ~^ 
Figure 96 schematically depicts the single tube process for target 
amplification and sequencing, which was performed, as described In 
5 detail in the Materials and Methods. Each of axon 5-8 of the p53 gene 
was PCR amplified using flanking primers in the Intron region; the down 
stream primer was biotinylated. Amplifications of different exons were 
optimized to use the same cycling profile, and the products were used 
without further purification. PCR reactions were performed in a 96 well 
10 microliter plate and the product generated in one well was used as the 
template for one sequencing reaction. Streptavidin-coated magnetic 
beads were added to the same microliter plate and amplified products 
were immobilized. The beads were then treated with NaOH to generate 
immobilized single-stranded DNA as sequencing template. The beads 
15 were washed extensively with Tris buffer since remaining base would 
reduce the activity of sequencing enzyme. 

A total of 21 primers were selected to sequence exon 5-8 of the 
P53 gene by primer walking. The 3'-end nucleotide of all the primers is 
located at the site where no known mutation exists. Four termination 
20 reactions were performed separately which resulted In a total of 84 
sequencing reactions on the same PCR microfiter plate. Non-cycling 
conditions were adopted for sequencing since streptavidin coated beads 
do not tolerate the repeated application of high temperature. Sequencing 
reactions were designed so that mt temiinated fragments were under 70 
nucleotides, a size range easily accessible by MALDI-TOF MS and yet 
long enough to sequence through the next primer binding site 
THermequenase was the enzyme of choice since It could reproducible 
generate a high yield of sequencing products in the desired mass range 
After the sequencing reactions, the b ads were washed with ammonium 
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ion buffers to replace all other cations. The sequencing ladders were 
then removed from the beads by heating In ammonium hydroxide solution 
or simply in water. 

A sub-microliter aliquot of each of the 84 sequencing reactions 
5 was loaded onto one MS sample holder containing preloaded matrix. 
Rgure 94 gives an example of sequencing data generated from one 
primer; four spectra are superimposed. 

All sequencing peaks were well resolved in the mass range needed 
to read through the next sequencing primer site. Sometimes doubly 
10 charged peaks were observed which could be easily Identified by 
correlating tiie mass to tiiat of the singly charged ion. False stops 
generated by eariy tennination of the enzymatic extension can be 
observed cle to the primer site. Since tiie mass resolution is high 
enough, it is easy to differentiate the false stop peaks from the real 
1 5 sequencing peaks by calculating the mass difference of the neighboring 
peaks and crs comparing tiie four spectra. Additionally, mt primers 
generated detectable data tiirough the region of tiie downstream primer 
binding site tiiereby covering tiie false stop region. 

Using optimized procedures of amplification, sequencing, and 
20 conditioning, exons 5-8 of tiie p53 gene were successfully sequenced. 
Correct wildtype sequence data were obtained from all exons witii a 
mass resolution about 300 to 800 over tiie entire mass range. The 
overall mass accuracy is 0.05% or better. The average amount of each - 
sequencing fragment loaded on the MS sample holder is estimated to be 
25 50 fmol or less. 

This example demonstrates tiie feasibility of sequencing exons of a 
human gene by MALDI-TOF MS. Compare to gel-based automated 
fluorescent DNA sequencing, tiie read lengths are shorter. Microchip 
technology can be incorporated to provide for parallel processing. 
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Sequencing products generated in the microtiter plate can be directly 
transferred to a microchip which serves as a launching pad for MALDI- 
TOF MS analysis. Robot-drlven serial and parallel nanoliter dispensing 
tools are being used to produce 100-1000 element DNA arrays on < 1 - 
square chips with flat or geometrically altered with wells) surfaces 

for rapid mass spectrometric analysis. 

Figure 94 shows an MS spectrum obtained on a chip where the 
sample was transfen-ed from a microtiter plate by a pintool. The 
estimated amount of each termination product loaded is 5 fmol or less 
which is in the range of amounts used In conventional Sanger sequencing 
with radiolabeled or fluorescent detection (0.5-1 fmol per fragment). The 
low volume MALDI sample deposition has the advantages of 
miniaturization (reduced reagent cts), enhanced reproducibility and 
automated signal acquisition. 

EXAMPLE 26 

Introduction 

Typically, matrix-associated laser desorption/ionization (Karas. et. 

20 al., (1989) //ir. J. Mass Spectrom. ion Processes. 92, 231) time-of-flight 
mass spectrometry (MALDI-TOF MS) of DNA molecules which are double 
stranded (ds) in solution yields molecular ions representative of the two 

single stranded components (Tang. eta/. (1994) Rapid n»nnr k.^^^ 

SESctr2i!Lfi:183; Tang, eta/. (1995) Nucleic Anirf^. r^, 2I ?1?fi; 

25 Benner, et a/. (1995) Rapid Common Mas.. Rn^^t 9,537. Liu, ef a/. 

(1995) AnaLChem, £2:3482; Siegert eta/. (1996) Anal. Biorh^o, 
2^:55; and Doktycz, eta/, (1995) AnaL_Biochem,23Q: 205); this has 
been observed in several reports dealing with biologically generated DNA 
from a polymerase chain reaction (PGR) amplification (Tang, eta/. (I994) 
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Rapid Commun. Mass Spectrom 8:183; Uu, etal. (1995) Anal. Ch^»^ 
62:3482; Siegert etal. (1996) Anal. Biochem. 243:55; and Doktycz et 
a/. (1995) Anal. Bfochem. 230:205). It is not clear whether the double 
strand is destabilized because of the decreased pH in the matrix 
5 environment or because of absorbance by the duplex during 

desorption/ionization/acceleration of an energy sufficient to overcome the 
attractive van der Waals and "stacking" stabilization forces (Cantor and 

Biophysical Chemistry Part l ; The rnnf^r^ ^^;^^ „^ 
Biamolesules, W.H. Freeman, New York, (1980), 176). When analyte is 
10 present at high concentrations fomiation of non-specific gas-phase ONA 
multimers is, as with proteins (Karas, et. al., (1989) Int. J. M«.< 
■ Spectrom, Ion Proresse. 32:231), common; however, Lecchi and Pannell 
(Lecchi etal. (1995) J. Am. Soc. Mass f^pertrom . 6:972) have provided 
strong evidence for specific Watson Crick (WC) base pairing being 
15 maintained in the gas phase. They detected these specific dimers when 
using 6-aza-2-thiothymlne as a matrix, but did not observe them with 3 
hydroxyplcclinic acid (3-HPA) or 2,4,6- hydroxyacetophenone matrix. As 
described below, by using a low acceleration voltage of the ions and 
preparing samples for MALDI analysis at reduced temperatures, routine 
20 detection of dsONA is possible. 
MATERIALS AND METHODS 

Synthetic DNA. Oligonucleotides were synthesized (Sinha, et al 
(1984) Nucleic Acids Res.. 12, 4539) on a Perspective Expedite DNA 
_ synthesizer and reverse phase HPLCpurified in-house. Sequences were- 
25 50-mer (1 5337 Da): 5'-TTG CGT ACA CAC TGG CCG TCG TTT TAC 
AAC GTC GTG ACT GGG AAA ACC CT-3' (SEQ ID NO. 109); 27-mer 
(complementary, 8343 Da): 5'-GTA AAA CGA CGG CCA GTG TGT ACG 
CAA-3' (SEQ ID NO. 110); 27-mer^ (non-complementary, 8293 Da)- 5'- 
TAC TGG AAG GCG ATC TCA GCA ATC AGC-3' (SEQ ID NO 111) 
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stock solutions were diluted to 20, 10, 5, and 2.5 using 
18Mohm/cm HjO. each of equimolar solutions of the 50-mer and 
either ay-mer^ or 27-nrier^ were mixed and allowed to anneal at room 
temperature for 10 minutes. 0;5//L of these mixtures were mixed directly 
5 on a sample target with 1 //L matrix (0.7 M 3-HPA, 0.07 M ammonium 
citrate in 50% acetonitrile) and allowed to air dry. 
Biological DNA. Enzymatic digestion of human genomic DNA from 
leukocytes was performed. PCR primers (forward, 5'-GGC ACQ GCT 
GTC CAA GGA G-3' (SEQ ID NO. 112)); reverse, 5'-AGG CCG CGC TCG 
10 GCG CCC TC-3' (SEQ ID NO. 1 1 3) to amplify a portion of exon 4 of the 
apoiipoprotein E gene were delineated from the published sequence (Das 
et ai., (1985) J. Biol. Chem., 260 6240). Taq polymerase and lOx 
buffer were purchased from Boehringer-Mannheim (Germany) and dNTPs 
from Phannacia (Freiburg, Germany). The total reaction volume was 50 
15 fA including 20 pmol of each primer and 10% DMSO (dimethylsulf oxide, 
Sigma) with approximately 200 ng of genomic DNA used as template. 
Solutions were heated to SO^C before the addition of lU polymerase; 
PCR conditions were: 2 min at 94«*C, followed by 40 cycles of 30 sec at 
94«>C, 45 sec at 63«C, 30 sec at 72«»C, and a final extension time of 2 
20 min at 72*»C. While no quantitative data was collected to determine the 
final yield of amplified product, it is estimated that -2pmol were available 
for the enzymatic digestion. 

Cfol and Rsal and reaction buffer L were purchased from 
Boehringer-Mannheim. 20//I of amplified products were diluted with 1 5//I 
25 water and 4//I buffer L; after addition of 10 units of restriction enzymes 
the samples were incubated for 60 min at 37''C. For precipitation of 
digest products 5//I of 3M ammonium acetate (pH 6,5), (5//I glycogen 
(Braun, et al. (1997) Clin. Chem. 43:1 151) (lOmg/ml. Sigma), and 110>t/l 
absolute ethanol were added to 50//L of the analyte solutions and stored 
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for 1 hour at room temperature. After at 10 min centrifugation at 
13,000 X g, the pellet was washed in 70o^ ethanol and resuspended in 
1 8Mohm/cm H^O. M«naea m 

Samp/e preparation and analysis by I^ALni-TOF MS Q 35^ of 
5 resuspended DNA was mixed with 0.35-1 .3 matrix solution (0 7M 3 
hydroxypicolinic acid (3-HPA), 0.07 M ammonium citrate in 1 • 1 
H,0:CH3CN) (Wu, et al. (1993) Rapid Commun .MassSagsaoa. 7- 1 42) 
on a stainless steel sample target disk and allowed to air dry preceding 
spectrum acquisition using a Thermo Bioanalysis Vision 2000 MALDI 
10 TOF instrument operated in pitive ion reflectron mode with 5 and 20 kV 
on the target and conversion dynode, respectively. Theoretical average 
molecular masses (M,(calc)) of the fragments were calculated from 
atomic compositions; the mass of a proton (I.O8 Da) was subtracted 
from raw data values in reporting experimental molecular masses 
5 (H(exp)) as neutral basis. External calibration generated from eight 
peaks (2000-18000 Da) was used for all spectra. 
Results and Discussion 

Hgure 96A is a MALDI-TOF mass spectrum of a mixture of the 
^ synthetic 50-mer with (non-complementary) 27-mer^ (each 10^, the 
highest final concentration used in this study); the laser power was 
adjusted to iust above the threshold irradiation for ioni2ation. The peaks 
at 8,30 and 15.34 kDa represent singly charged ions derived from the 
27- and 50-mer single strands, respectively. Pooriy resolved low 
mtensity signals at -16.6 and -30.7 kDa represent homodimers of 27- 
and 50.mer, respectively; that at 23.6 kDa is consistent with a 
heterodimer containing one 27-mer and one 50-mer strand. Thus low 
intensity dimer ions representing all possible combinations from the two 
non-complementary oligonucleotides (27 + 27; 27 + 50; 50 -f- 50) were 
Observed. Increasing the irradiance even to a point where depurination 
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peaks dominated the spectrum resulted in slightly higher intensities of 
these dimer peaks. Note tiiat the hybridization was performed at room 
temperature and witfi a very low salt concentration, conditions at which 
non-specific hybridization may occur. 
5 Figure 96 shows a MALDI-TOF spectrum of the same 50-mer 

mixed witii (complementary) 27-merc; the final concentration of each 
oligonucleotide was again 10//M. Using tiie same laser power as in 
Figure 96A, intense signals were again observed at 88.34 and 15.34 
Kda, consistent with single sti^anded 27- and 50-mer, respectively. 
10 Homodimer peaks (27-h27; 50 + 50) were barely apparent In tfie noise; 
however, singly (23.68 Kda) and doubly (1 1 .84k Da) charged 
heterodimer (27 + 50) peaks were dominant. Altiiough the 23.68 Kda 
dimer peak could be detected from all in-adiated positions, its intensity 
relative to the monomer peaks varied slightiy from spot-to-spot. 
1 5 Repeating the experiment witii individual oligonucleotide concentrations 
of 5, 2.5, and 1.25 /rfVI resulted in decreasing amounts of the 27-/50-mer 
Watson-Crick dimer peak relative to tiie 27- and 50-mer single sti^anded 
peaks. At tiie lowest concenti^ations, tiie observation of dimer was 
"crystal-dependent-, tiiat is, irradiation of some crystals produced 
20 significant 27./50-mer dimer signal, while otfier crystals reproducibly 

yielded very little or none. This indicates tiiat tiie incorporation of dsdna 
into tiie matrix crystals or the effectiveness of retaining tills interaction 
through tiie ionization/desorption process is dependent upon tfie 
microscopic properties of tiie crystals, and/or tiiat tfiere exist steep 
25 concenti^ation gradients of tiie duplex throughout tfie sample. 

Thus the Figure 96 specti^a provide sti-ong evidence tfiat specific 
WC base paired dsdna can be observed using gentie laser conditions witii 
f»igh concentrations of oligonucleotides in this mass range, tfie first 
r port of tfiis using a 3-HPA mati^ix. The study was extended to a 
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complex mixture of dsdna derived from an enzymatic digest (Rsal/Cfol) of 
a region of exon 4 of the apolipoprotein E gene (Das et. al., (1985) J. 
Biol. Chem., 260 6240); expected fragment masses are given in 
Table IX. 

Table IX 

Cfol/Rsal Digestion Products from ApoE gene exon 4* 
bases'* 




0 V3 allele has no 1 7/1 9 or 1 9/1 9 pairs; ^ 4 allele contains no 36/38 pair. 
( + > sense strand. (-) antisense strand 

After the digestion step, the samples were purified and concentrated by 
ethanol precipitation and resuspended in 1//L H^O before mixing them at 
room temperature with matrix on the sample target. Neariy 20 peaks 

5 ranging in mass from 3.4-1 7.2 Kda were resolved in the products' 
MALDI spectrum (Figure 97A), all consistent with denatured single 
stranded components of the double strand (Table IX). Many such 
analyses of similar biological products over a period of months also 
yielded spectra with negfigible dsdna, consistent with previous reports 

' (Tang, et al. ( 1 994) Raoid Commun. Mass <^n^^tr^», S: 1 83; Uu. et al. 
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(1995) Anal. Chem. 67:3482; Siegert et al. (1996) Anal. Binoh^r» 
2M:55; and Doktycz, etal. (1995) AoaLBiociTgnL 230:205); contrarily, 
intact double strands were observed under similar conditions for the 
synthetic DNA (Figure 96A). It is difficult to estimate the strand 
5 concentration available after the biological reactions, but presumably that 
it was far lower than that at which dimerization of synthetic samples 
occurred. Furthermore, maintaining specific hybrids within the two- 
component synthetic mixture may be kinetically favored relative to the 
far more complex mixture of 20 single-stranded DNA components from 
10 thedigest. 

The effect of reduced temperature on maintaining dsDNA was 
tested. An aliquot of the digested DNA solution, the matrix, pipette, 
pipette tips, and the stainless steel sample target were stored in a 4«>C 
-cold room- for 15 minutes; as with normal preparations matrix, and 
15 then analyte, were spotted on the target and allowed to co-crystallize 
while air drying. Crystallization for mixtures of 300 nL 3HPA (50% 
acetonitrile) with 300 nL analyte required ~1 minute at room 
temperature but ~ 15 minutes at the reduced temperature. Sample spots 
prepared in the cold room environment typically contained a high 
20 proportion of large transparent crystals. 

MALDI-TOF analysis of an ApoE digest aliquot pi^epared at reduced 
temperature produced the Rgure 97B spectrum. While the low mass 
range appeared qualitatively similar to Rgure 97A, dramatic differences 
above 8 kDa were observed. Only signals consistent with single strands 
25 (Table IX) were observed in Rgure 97A, but the Figure 97B cold room 

prepared samples did not yield signals for the same masses except below 
8 kDa. Even more striking were the additional high mass peaks in Rgure 
97B; clearly these represent dimer peaks containing lower mass 
components. As was done with the synth tic DNA, it was important to 
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determlne whether these represent non-specific heterodimers, specific 
WC heterodimers, or nonspecific homodimers. Consider first the 33 35 
kDa fragment. Ignoring the unlikely possibility that the high mass 
fragment represents a trimer or higher multimer, as a dimer it must onlv 
5 contain the highest mass ssDNA components, Lfi. the > 1 6 kDa 

Homodimerizationof the 15.24 and 1 7. 1 8 kDa fragments would result in 
32.49 an, 34.35 kDa peaks, respectively; corresponding mass err^: f ; 
these .ncorrect assignments relative to the observed 33.35 kDa would be 
-2.6% and -h3.0% respectively. A far better match is achieved if this 
10 peak originates from a heterodimer of the two highest mass single 

stranded fragments; their summed mass (16.24+17,18 = 33 42 kDa) 
differed by 0.2% from the observed dimer mass 33.35 kDa an 

acceptable mass error for MALDI-TOF analysis of large DNA fragments 
usmg external calibration. Ukewise, the 29.66kDa fragment was 
15 measured only 0.13% lower than the 29.70 Da expected for a 

heterodimer of 48-mers; the sum of no other possible homodimers or 
heterodimers were within a reasonable range of this mass. Similar 
arguments could be made for the 22.89 and 1 8.83 kDa fragments 
representing 36-/38-mer and 31-/29-heterodimers. respectively; the 
Signal at 14.86 kDa Is consistent with singly charged single stranded and 
doubly charged double-stranded 48-mer. The agreement of the Figure 
97B masses above 15 kDa with the of dsDNA expected from this digest 
and the absence of homodimers and non-specific heterodimers at random 
masses indicated that the base pairings were indeed highly specific and 
25 provided further evidence that gas-phase WC interactions may be 
retamed in MALDI-generated ions. 

Rgure 98 shows a MALDI-TOF spectrum of an ^4 allele, which 
unhke the e3, was expected to yield no 36-/38-mer pair upon Cfol/Rsll 
digestion. The ^3 and e4 mass spectra were similar except that 
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abundant 22.89 kDa fragment in Figure 97B was not present in Rgure 
98; with this information alone (Table IX) f 3 and eA- alleles were easily 
distinguished, thereby demonstrating the genotyping by direct 
measurement of dsDNA by MALDI-TOF MS. Similarly dsDNA could be 
5 ionized, transferred to the gas phase, and detected by MALDI-TOF MS 
The acceleration voltage typically employed on our instalment was only 
5kV corresponding to 1.5kV/mm up to -2 mm from the sample target, 
with the electric field strength decreasing rapidly with distance from tiie 
sample target. Most previous work used at least 20kV acceleration 
10 (Lecchi et at. (1995) J. Am. Soc. Mass SnAntr»», g.972,. one 

exception a 27-mer dsDNA was detected using a frozen matrix solution 
and 100 V acceleration (Nelson, et aL (1990) Rapid Cnmcr...» t.^^^. 
SeecttoOL 4:348). Without being bound by any theory MALDI-induced 
"denaturation- of dsDNA may be due to gas-phase collisional activation 
15 that disrupts the WC pairing when high acceleration fields are employed, 
analogous to the denaturation presumed to be a first step in the 
fragmentation used for sequencing the single stranded components of 
dsDNA using electrospray ionization (McLafferty et al. (1996) Int. J. 
Mqss Speotrom., Ion Processes) . It appears that the high salt 
20 concentrations (typically > 1 0mM NaCI or KCI) required to stabilize WC 
paired dsDNA in solution are unsuitable for MALDI analysis (Nordhoff et 
BL (1993) Nucleic Acids Res 21:3347); reducing the concentration of 
such non-volatile cations is necessary to avoid cation-adducted MALDI 
signals, but destabilizes the double strands in solution. The low pH 
25 conditions of the matrix environment should also destabilize the duplex. 
As shown in Rgures 97B and 98, storing and preparing even low 
concentrations of the biological samples at reduced temperature at least 
in part offset these denaturing effects, especially for longer strands 
where melting temperatures are higher due to a more ext nsive hydrogen 
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bonding network. The conditions used here are recognized to be very 
non-stringent annealing conditions. 

The low mass tails on high mass dsDNA peaks (g^.. Figure 97B 
232 kDa) are consistent with depurination generated to a higher extent 
5 than the sum of depurination from each of the single strands combined 
Although depurination in solution is an acid-catalyzed reaction, the 
weakly acidic conditions in the 3-HPA matrix do not induce significant 
depurination; molecular ion signals from a mixed-base 5Q-mer measured 
with De-MALDI-TOF had only minor contributions from depurination 
10 peaks (Juhaz,e../. (1996) AoaLChem^ Depurination from the 

smgle stranded components of the gas-phase dsDNA is observed even 
though these bases are expected to be hydrogen bonded to the 
complementary base of the accompanying strand, implying that covalent 
bonds are being broken before the strand is denatured. 

EXAMPLE 27 

Efficiency and Speaficity Assay for Base-Specific Ribonucleases 

Aliquots sampled at regular time intervals during digestion of 
selected synthetic 20 to 25 mers were analyzed by mass spectrometry 
Three of the RNAses were found to be efficient and specific TTiese 
20 include: the G-speclfic T„ the A-specific U, and the A/U-specific PhyM 
The nbonucleases presumed to be C-specific were foQnd to be less 
reliable, did not cleave at every C or also cleaved at U in an 
unpredictable manner. The three promising RNAses all yielded cleavage 
at all of the predicted positions and a complete sequence coverage was 
25 Obtained. addition, the presence of cleavage products containing one 
or several uncleaved positions (short incubation times), allowed 
alignment of the cleavage products. An example of the MALDI-spectrum 
of an aliquot sampled after T, digest of a synthetic 20-mer [SEQ ID 
NO:l 141 RNA is shown in Rgure 100. 
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EXAMPLE 28 

Immobilization of amplified DNA targets to silicon wafers 
Silicon surface preparation 

Silicon wafers were washed with ethanol, flamed over bunsen 
5 burner, and immersed in an anhydrous solution of 25% (by volume) 3- 
aminopropyltriethoxysilane in toluene for 3 hours. The silane solution 
was then removed, and the wafers were washed three times with 
toluene and three times with dimethyl sulfoxide (DMSO): The wafers 
were then incubated in a lOmM anhydrous solution of N-succinimidyl (4- 

10 iodoacetyl) aminobenzoate (SIAB) (Pierce Chemical, Rockford, IL) in 
anhydrous DMSO. Following the reaction, the SIAB solution was 
removed, and the wafers were washed three times with OMSO. In all 
cases, the iodoacetamido-functionalized wafers were used immediately to 
minimize hydrolysis of the labile iodoacetaniido-functionality. 

1 5 Additionally, all further wafer manipulations were performed in the dark 
since the iodoacetamido-functionality is light sensitive. 

immobilization of amplified thioi-containing nucleic acids 
The SIAB-conjugated silicon wafers were used to analyze specific 
free thiol-containing DNA fragments of a particular amplified DNA target 

20 sequence. A 23'mer oligodeoxynucleotide containing a 5 '-disulfide 

linkage (purchased from Operon Technologies; SEQ ID NO: 117] that is 
complementary to the 3'-region of a 1 1 2 bp human genomic DNA 
template [Genebank Acc. No.: Z52259; SEQ ID NO: 118] was used as a 
primer in conjunction with a commercially available 49-mer primer, which 

25 is complementary to a portion of the 5'-end of the genomic DNA 
(purchased from Operon Technologies; SEQ ID NO: 119], in PGR 
reactions to amplify a 1 35 bp DNA product containing a 5'*disulfide 
linkage attached to only one strand of the DNA duplex [SEQ ID NO: 
1201. 
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The PGR amplification reactions were performed using the 
Amplitaq GoldKit [Perkin Elmer Catalog No. N808-0249J. Briefly 200 n 
1 12 bp human genomic DNA template was incubated with 10^ of 23-^ 
mer primer and 8pM of commercially available 49-mer primer 1 0 mM 
5 dNTPs, 1 unit of Amplitaq Gold DNA polymerase in the buffer provided 
by the manufacturer and PGR was performed in a thermocycler. 

The 5'-disulfide bond of the resulting amplified product was fully 
reduced using 10 mM tris-(2-carboxyethyl) phosphine (TCEP) (Pierce 
Chemical, Rockford, IL) to generate a free S'-thiol group. Disulfide 
10 reduction of the modified oligonucleotide was monitored by observing a 
shift ,n retention time on reverse-phase FPLC. It was determined that 
after five hours in the presence of 10 mM TCEP, ti,e disulfide was fully 
reduced to a free tiilol. Immediately following disulfide cleavage the 
modified oligonucleotide was incubated witfi tiie iodacetamido- ' 
15 functionalized wafers and conjugated to the surface of ti,e silicon wafer 
through ti^e SIAB linker. To ensure complete ti,iol deprotonation the 
coupling reaction was performed at pH 8.0. Using lOmM TCEP to cleave 
tiie disulfide and ti,e other reaction conditions described above it was 
possible to reproducibly yield a surface density of 250 fmol per square 
20 mm of surface. 

Hybridization and MALDI-TOF Mass spectrometry 
The silicon wafer conjugated witii tiie 135 bp tiiiol-containing DNA 
was mcubated witii a complementary 12.mer oligonucleotide [SEQ ID 
NO: 1211 and specifically hybridized DNA fragments were detected using 
25 MALDI-TOF MS analysis. The mass spectrum revealed a signal witii an 
observed experimental mass-to-charge ratio of 3618.33; the theoretical 
mass-to-charge ratio of tiie 12-mer oligomer sequence is 3622.4 Da. 

Thus, specific DNA target molecule tfiat contain a 5'-disulfide 
linkage can be amplified. The molecules are immobilized at a high 
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denslty on a SIAB-derivatized silicon wafer using the methods described 
herein and specific complementary oligonucleotides may be hybridized to 
these target molecules and detected using MALDI-TOF MS analysis. 

EXAMPLE 29 

^ A^ayJ "'^^ "^^"^ NudelcAdd Immobilization to Generate Nucleic Add 

Employing the high density attachment procedure described in 
EXAMPLE 28, an -array of DNA oligomers amenable to MALDI-TOF mass 
spectrometry analysis was created on a silicon wafer having a plurality of 
10 locations, e^ depressions or patches, on Its surface. To generate the 
array, a free thiol-containing oligonucleotide primer was immobilized only 
at the selected locations of the wafer tg^ see EXAMPLE 28]. The each 
location of the array contained one of three different oligomers. To 
demonstrate that the different immobilized oligomers could be separately 
detected and distinguished, three distinct oligonucleotides of differing 
lengths that are complementary to one of the three oligomers were 
hybridized to the an^ay on the wafer and analyzed by MALDI-TOF mass 
spectrometry. 

Oligodeoxynudeotides 

Three sets of complementary oligodeoxynucleotide pairs were 
synthesized in which one member of the complementary oligonucleotide 
pair contains a 3'- or 5'-disulfide linkage [purchased from Operon 
Technologies or Oligos, Etc.J. For example. Oligomer 1 
Id(CTGATGCGTCGGATCATCTTTTTT-SS); SEC ID NO: 1221 contains a 
25 3'-disulfide linkage whereas Oligomer 2 (d(SS- 

CCTCTTGGGAACTGTGTAGTATT); a 5'-disulfide derivative of SEQ ID 
NO: 1171 and Oligomer 3 Id(SS-GAATTCGAGCTCGGTACCCGG); a 5'- 
disulfide derivative of SEQ ID NO: 1 1 5] each contain a 5'-disulfide 
linkag . 
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The oligonucleotides complementary to Oligomers 1-3 were 
designed to be of different lengths that are easily resolvable from one 
another during MALDf-TOF MS analysis. For example, a 23-mer 
oligonucleotide fSEQ ID NO: 1231 was synthesized complementary to a 
5 portion of Oligomer 1, a 12-mer oligonucleotide (SEQ ID NO: 121 J was 
synthesized complementary to a portion of Oligomer 2 and a 21-mer 
[SEQ ID NO: 1 16J was synthesized complementary to a portion of 
Oligomer 3. In addition, a fourth 29-mer oligonucleotide, f SEQ ID NO- 
124] was synthesized that lacks complementarity to any of the three 

10 oligomers. This fourth oligonucleotide was used as a negative control. 
Silicon surface chemistry and DNA immobiHzation 
(a) 4x4(1 &4ocation} array 

A 2 X 2 cm' silicon wafer having 256 individual depressions or 
wells in the form of a 16 X 16 well array was purchased from a 
15 commercial supplier (Accelerator Technology Corp., College Station 
Texasj. The wells were 800 X 800 fnry\ 1 20 /m, deep, on a 1.125 
pitch. The silicon wafer was reacted with 3-aminopropyltriethoxysllane 
to produce a uniform layer of primary amines on the surface and then 
exposed to the heterobifunctional crosslinker SIAB resulting in 
20 iodoacetamido functionalities on the surface le^ see EXAMPLE 281. 

To prepare tiie ofigomers for coupling to tiie various locations of 
the silicon anray, the disulfide bond of each oligomer was fully reduced 
using 10 mM TCEP as depicted in EXAMPLE 28. and the DNA 
resuspended at a final concentration of 10 //M in a solution of 100 mM 
25 phosphate buffer, pH 8.0. Immediately following disulfide bond 
reduction, the free-tiilol group of the oligomer was coupled to tiie 
iodoacetamido functionality at 1 6 locations on tiie wafer using the probe 
coupling conditions essentially as described above in EXAMPLE 28 To 
accomplish tiie separate coupling at 16 distinct locations of tiie wafer 



wo 98/20166 



PCT/DS97/20444 



-219- 



the entire surface of the wafer was not flushed with an oligonucleotide 
solution but. Instead, an ~30-nl aliquot of a predetermined modified 
oligomer was added In parallel to each of 16 locations (i.e., depressions) 
of the 256 wells on the wafer to create a 4 x 4 array of immobilized DNA 
5 using a robotic pintool. 

The robotic pintool consists of 16 probes housed in a probe block 
and mounted on an X Y, Z robotic stage. The robotic stage was a gantry 
system which enables the placement of sample trays below the arms of 
the robot. The gantry unit itself is composed of X and Y arms which 
10 move 250 and 400 mm. respectively, guided by brushless linear servo 
motors with positional feedback provided by linear optical encoders. A 
lead screw driven 2 axis (50 mm vertical travel) Is mounted to the xy axis 
slide of the gantiv unit and is controlled by an in-line rotary servo motor 
with positional feedback by a motor-mounted rotary optical encoder. The 
15 work area of tiie system is equipped witii a slide-out tooling plate that 
holds five microtiter plates (most often, 2 plates of wash solution and 3 
plates of sample for a maximum of 1 152 different oligonucleotide 
solutions) and up to ten 20x20 mm wafers. The wafers are placed 
precisely in tiie plate against two banking pins and held secure by 
20 vacuum. The entire system Is enclosed In plexl-glass housing for safety 
and mounted onto a steel support frame for tfiermal and vibrational 
damping. Motion conti-ol is accomplished by employing a commercial 
motion controller which was a 3-axis servo controller and Is integrated to 
a computer; programming code for specific applications is written as 
25 needed. 

To create the DNA an^ay, a pintool with assemblies that have solid 
pin elements was dipped into 16 wells of a multi-well DNA source plate 
containing solutions of Oligomers 1-3 to wet tiie distal ends of th pins, 
the robotic assembly moves the pin assembly to the silicon wafer, and 
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the sample spotted by surface contact. Thus, one of modified Oligomers 
1-3 was covalentiy immobilized to each of 16 separate we«s of the 256 
wells on the silicon wafer thereby creating a 4 x 4 array of immobilized 
DNA. 

5 In carrying out the hybridization reaction, the three complementary 

oligonucleotides and the negative control oligonucleotide were mixed at a 
: final concentration of 10//M for each oligonucleotide in 1 ml of TE buffer 
no mM Tris-HCI, pH 8.0. 1 mM EDTAJ supplemented with 1 M NaCI 
and the solution was heated at eS^C for 10 min. Immediately thereafter 
10 the entire surface of the silicon wafer was flushed with 800 //I of the 
heated oligonucleotide solution. The complementary oligonucleotides 
were annealed to the immobilized oligomers by incubating the silicon 
array at ambient temperature for 1 hr, followed by incubation at 4-C for 
at least 1 0 min. Alternatively, the oligonucleotide solution can be added 
to the wafer which is then heated and allowed to cool for hybridization. 

The hybridized array was then washed with a solution of 50 mM 
ammonium citrate buffer for cation exchange to remove sodium and 
potassium ions on the DNA backbone (Pieles etaL. (1993) fsiucL^ 
Bes^ 11:3191-3196). A 6-nl aliquot of a matrix solution of 3- 
20 hydroxypicoiinic acid 10.7 M 3-hydroxypicoIinic acid-10 % ammonium 
citrate in 50 % acetonitrile; see Wu et al. Rapid Com,^..o k..^, 
Seecaooi. 2: 142-146 (1993)1 was added in series to each location of the 
array using a robotic piezoelectric serial dispenser (Le„ a piezoelectric 
pipette system). 

25 The piezoelectric pipette system is built on a system purchased 

from Microdrop GmbH, Norderstedt Germany and contains a piezoelectric 
element driver which sends a pulsed signal to a piezoelectric element 
bonded to and surrounding a glass capillary which holds the solution to 
be dispensed; a pressure transducer to load (by negative pressure) or 



wo 98/20166 



PCT/US97/20444 



-221- 



empty (by positive pressure) the capillary; a robotic xyz stage and robot 
driver to maneuver the capillary for loading, unloading, dispensing, and 
cleaning, a stroboscope and driver pulsed at the frequency of the piezo 
element to enable viewing of 'suspended' droplet characteristics; 
5 separate stages for source and designation plates or sample targets (i.e. 
Si chip); a camera mounted to the robotic arm to view loading to 
designation plate; and a data station which controls the pressure unit 
xyz robot, and piezoelectric driver. 

The 3-HPA solution was allowed to dry at ambient temperature 
10 and thereafter a 6-nl aliquot of water was added to each location using 
the piezoelectric pipette to resuspend the dried matrix-DNA complex, 
such that upon drying at ambient temperature the matrix-DNA complex 
forms a uniform crystalline surface on the bottom surface of each 
location. 

1 5 MALOI-TOF MS analysis 

The MALDI-TOF MS analysis was performed in series on each of 
the 16 locations of the hybridization array illustrated in Rgure 6 
essentially as described in EXAMPLE 28. The resulting mass spectrum of 
oligonucleotides that specifically hybridized to each of the 16 locations of 
20 the DNA hybridization revealed a specific signal at each location 
representative of observed experimental mass-to-charge ratio 
corresponding to the specific complementary nucleotide sequence. 

For example, in the locations that have only Oligomer 1 conjugated 
thereto, the mass spectrum revealed a predominate signal with an 
25 observed experimental mass-to-charge ratio of 7072.4 approximately 
equal to that of the 23-mer; the theoretical mass-to-charge ratio of the 
23-mer is 7072.6 Da. SimHariy, specific hybridization of the 12-mer 
oligonucleotide to the an-ay, observed experimental mass-to-charge ratio 
of 3618.33 Da (theoretical 3622.4 Oa), was detected only at those 
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locations conjugated with Oligomer 2 whereas specific hybridization of 
MJM6 (observed experimental mass-to-charge ratio of 6415.4) was 
detected only atthose locations of the array conjugated with Oligomer 3 
[theoretical 6407.2 DaJ. 

5 None of the locations of the array revealed a signal that 

corresponds to the negative control 29-mer oligonucleotide (theoretical 
mass-to-charge ratio of 8974.8) indicating that specific target DNA 
molecules can be hybridized to oligomers covalently immobilized to 
specific locations on the surface of the silicon array and a plurality of 
10 hybridization assays may be individually monitored using MALOI-TOF MS 
analysis. 

(b) 8x8 (64-location) array 

A 2 X 2 cm^ silicon wafer having 256 individual depressions 
or wells that fomi a 16 X 16 array of wells was purchased from a 
1 5 commercial supplier {Accelerator Technology Corp., College Station, 
Texas!. The wells were 800 X 800/ml^ 120 /mi deep, on a 1.125 
pitch. The silicon wafer was reacted with 3-aminopropyltrlethoxysilane 
to produce a uniform layer of primary amines on the surface and then 
exposed to the heterobifunctional crosslinker SIAB resulting in 
20 iodoacetamido functionalities on the surface as described above. 

To make an an^ay of 64 elements, a pintool was used following the 
procedures described above. The pintool was dipped into 16 wells of a 
384 well DNA source plate containing solutions of Oligomers 1-3. moved 
to the silicon wafer, and the sample spotted by surface contact. Next 
25 the tool was dipped in washing solution, then dipped into the same 16 
wells of the source plate, and spotted onto the target 2.25mm offset 
from the initial set of 16 spots; the entire cycle was repeated to make a 
2x2 array from each pin to produce an 8x8 array of spots (2x2 
elements/pin X 16 pins = 64 total elements spotted). 



WO9fl/201«6 



PCr/US97/20444 



10 



-223- 

Oligomers 1-3 immobilized to the 64 locations were hybridized to 
complementary oligonucleotides and analyzed by MALDI-TOF MS 
analysis. As observed for the 16-location array, specific hybridization of 
the complementary oligonucleotide to each of the immobilized thiol- 
containing oligomers was observed in each of the locations of the DNA 
array. 

EXAMPLE 30 

Extension of hybridized DNA primers bound to DNA templates 
immobilized on a silicon wafer 

The SIAB-derivatized silicon wafers can also be employed for 
primer extension reactions of the immobilized DNA template using the 
procedures essentially described in EXAMPLE 7. 

A 27-mer oligonucleotide [SEQ ID NO: 125] containing a 3'-free 
thiol group was coupled to a SIA&<lerivatized silicon wafer as described 
15 above, for example, in EXAMPLE 28. A 12-mer oligonucleotide primer 
ISEQ ID NO: 1261 was hybridized to the immobilized oligonucleotide and 
tiie primer was extended using a commercially available kit [e.g. . 
Sequenase or Thermo Sequenase, U.S. Biochemical Corp]. The addition 
of Sequenase DNA polymerase or ThermoSequenase DNA polymerase in 
20 the presence of three deoxyribonucleoside triphosphates (dNTPs; dATP, 
dGTP, dCTP) and dideoxyribonucleoside thymidine triphosphate (ddTTP) 
in buffer according to the instructions provided by the manufacturer 
resulted in a 3-base extension of tiie 12-mer primer while still bound to 
the silicon wafer. The wafer was then analyzed by MALDI-TOF mass 
25 spectrometty as described above. The mass spectrum results cleariy 
distinguish the 15-mer [SEQ ID NO: 1271 from the original unextended 
12-mer thus indicating that spedfic extension can be performed on the 
surface of a silicon wafer and detected using MALDI-TOF MS analysis. 
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EXAMPLE 31 



Effect of linker length on polymerase extension of hybridized DWa 
pnmers bound to DMA templates Immoblfeed on a Xon Xafer 

The effect of the distance between the SIAB-conjugated silicon 

5 surface and the duplex DNA formed by hybridization of the target ONA to 
the .mmobifeed oligomer template was investigated, as well as choice of 
enzyme. 

Two SIAB-derivatized silicon wafers were conjugated to the 3'-end 
of two free thiol-containing oligonucleotides of identical DNA sequence 

10 except for a 3-base poly dT spacer sequence incorporated at the 3'-end- 
CTGATGCGTC GGATCATCTT TTTT SEQ ID No. 1 22 

CTGATGCGTC GGATCATCTT TTTTTTT SEQ ID No. 1 25. 
These oligonucleotides were synthesized and each was separately 
Immobilized to the surface of a silicon wafer through the SIAB cross 

15 linker [e^. see EXAMPLE 28J. Each wafer was incubated with a 12-mer 
oligonucleotide: 

AAAAAAGATG AT SEQ ID No. 126 

GATGATCCGACG SEQ ID No. 128 

GATCCGACGC AT SEQ ID No. 129, 

20. Which is complementary to portions of the nucleotide sequences common 
to botii of the oligonucleotides, by denaturing at 75 "C and slow cooling 
the silicon wafer. The wafers were tiien analyzed by MALDI-TOF mass 
spectrometry as described above. 

As described in EXAMPLE 30 above, a 3.base specific extension 
of tiie bound 1 2-mer oligonucleotide was observed using ti,e oligomer 
pnmer where tiiere is a 9-base spacer between tiie duplex and the 
surface [SEQ ID NO: 125]. Similar results were observed when the DNA 
spacer lengtfis between the SIAB moiety and tiie DNA duplex were O 3 

6 and 12. In addition, the extension reaction may be performed using a 
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(US B.ochem,cal). Thus, the SIAB .Inker may be directly coupled to the 
DNA template or may include a linker sequence without effecting primer 
extension of the hybridized ONA. 

® EXAMPLE 32 

Spectrochip mutant detection in ApoE gene 

This example describes the hybridization of an immobilized 
template, primer extension and mass spectrometry for detection of the 
wddtype and mutant Apolipoprotein E gene for diagnostic purposes 
TNs example demonstrates that immobilized DNA molecules containing a 
specific sequence can be detected and distinguished using pri„,er 
extension of unlabeled allele specific primers and analysis of the 
extension products using mass spectrometry. 

A 50 base synthetic DNA template complementary to the coding 
15 sequence of allele 3 of the wildtype apolipoprotein E gene- 

or complement to the mutant apolipoprotein E gene carrying a G -*A 
transition at codon 158: 

20 6--GCCTGGTACACTGCCAGGCACTTCTGCAGOTCATCa6CATCGCGGAGOAG 3- 
fSEQ 10 NO: 28tl vi«o<=Aa-3 

containing a 3'-free thiol group was coupled to separate SIAB^erivatized 
silicon wafers as described In Example 28. 
A 21-mer oligonucleotide primer: 
!5 GCC GAT GAC CTG CAG AAG-3' (SEQ 10 NO: 282] w» 

hybridi«d ,o each of *e immobriized templates and *e primer was 
extended using a commercially available kit leg.. Sequenase or 
Thermosecuenase. U.S. Biochemical Corpl. The addition „, Sequenase 

0 Oft "' ^^'^ "-^A polymerase in the presence 

of three deoxynbonucleoside triphosphates (dNTPs; dATP. dQTP. dTTP) 



WO<>8AZ01«« 



PCT/US97/20444 



-226- 



10 



and dideoxyribonucleoside cytosine triphosphate (ddCTP) in buffer 
according to the instructions provided by the manufacturer resuited in a 
single base extension of the 21.mer primer bound to the immobilized 
template encoding the wildtype apolipoprotein E gene and a three base 
extension of the 21-mer primer bound to the immobilized template 
encoding the mutant form of apolipoprotein E gene. 

The wafers were analyzed by mass spectrometry as described 
herein. The wildtype apolipoprotein E sequence results in a mass 
spectrum that distinguishes the primer with a single base extension (22- 
mer) with a mass to charge ratio of 6771,17 Da (the theoretical mass to 
charge ratio is 6753.6 Da) from the original 21-mer primer with a mass 
to charge ration of 6499.64 Da. The mutant apolipoprotein E sequence 
results in a mass spectrum that distinguishes the primer with a three 
base extension (24-mer) with a mass to charge ratio of 7386.9 (the 
theoretical mass charge is 7386.9) from the original 21-mer primer with a 
mass to charge ration of 6499.64 Da. 

EXAMPLE 33 

Detection of Double-Stranded Nucleic Acid Molecules via Strand 
SuSel™ Hybridization to an Immobnized CompS.S^^% 

This example describes immobilization of a 24-mer primer and the 
specific hybridization of one strand of a duplex DNA molecule, thereby 
pemiitting amplication of a selected target molecule in solution phase and 
permitting detection of the double stranded molecule. This method is 

25 useful for detecting single base changes, and. particulariy for screening 
genomic libraries of double-stranded fragments. 

A 24-mer DNA primer CTGATGCGTC GGATCATCTT TTTT 
SEQ ID No. 122, containing a 3'-free thiol group was coupled to a SIAB 
-derivatized silicon wafer as described in Example 29. 

30 An 1 8-mer synthetic oligonucleotide : 
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5'-CTGATGCGTCGGATCATC-3' [SEQ ID NO: 286] was 
premixed with a 12-mer 5'-GATGATCCGACG-3' (SEQ ID NO: 285J 
that has a sequence that is complementary to 1 2 base portion of the 1 8- 
mer oligonucleotide. The oligonucleotide mix was heated to 75«>C and 
5 cooled slowly to room temperature to faciliate the formation of a duplex 
molecule: 

5'-CTGATGOGTCX3GATCATC-3' [SEQ ID NO. 286 J 
3'- GCAGCCTAGTAG-5' ISEQ ID NO: 2871, 

The specific hybridization of the 1 2-mer strand of the duplex 
10 molecule to the immobilized 24^mer primer was carried out by mixing 
1//M of the duplex molecule using the hybridization conditions described 
in Example 30. 

The wafers were analyzed by mass spectrometry as described 
above. Specific hybridization was detected in a mass spectrum of the 
15 12-mer with a mass to charge ratio of 3682.78 Da. 

EXAMPLE 34 

1 -(2-Nitro-5-(3-0-4,4'-dImethoxytritylpropoxy)phenyl)- 1 -0-((2- 
cyanoethoxy)-dnsopropyfaminophosphino)ethane 

A. 2-Nitro-5-(3-hydroxypropoxy)benzaldehyde 
20 3-Bromo-1 -propanol (3.34 g, 24 mmol) was refluxed in 80 ml of 

anhydrous acetonitrile with 5-hydroxy-2-nitrobenzaldehyde (3.34 g, 20 
mmol), K2CO3 (3.5 g), and Kl (100 mg) overnight (15 h). The reaction 
mixture was cooled to room temperature and 1 50 ml of methylene 
chloride was added. The mixture was filtered and the solid residue was 
25 washed with methylene chloride. The combined organic solution was 
evaporated to dryness and redissoived in 100 ml methylene chloride. 
The resulted solution was washed with saturated NaCI solution and dried 
over sodium sulfate. 4.31 g (96%) of desired product was obtained after 
removal of the solvent in vacuo. 
30 = 0.33 (dichloromethane/methanol, 95/5). 
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UV (methanol) maximum: 313, 240 (shoulder), 215 nm; minimum: 266 
nm. 

'H NMR {DMSO-d«) 6 10.28 (s, IH), 8.17 (d, 1H). 7.35 (d, 1H), 7 22 (s 
IH), 4.22(t, 2H), 3.54 (t, 2H), 1.90 (m, 2H). 

"C NMR (DMSO-de) 6 189.9, 153.0, 141.6, 134.3, 127.3, 118 4 
114.0, 66.2, 56.9, 31.7. 

B. 2-Nltro-5-(3-0-t-butyldimethylsilylpropoxy}ben2aldehyde 
2-Nitro-5-(3-hydroxypropoxy)ben2aldehyde(1 g, 4.44 mmol) was 
dissolved In 50 ml anhydrous acetonitrile. To this solution, it was added 
1 ml of triethylamine, 200 mg of imidazole, and 0.8 g (5.3 mmol) of 
tBDMSCI. The mixture was stirred at room temperature for 4 h. 
Methanol (1 ml) was added to stop the reaction. The solvent was 
removed in vacuo and the solid residue was redissolved in 100 ml 
methylene chloride. The resulted solution was washed with saturated 
sodium bicarbonate solution and then water. The organic phase was 
dried over sodium sulfate and the solvent was removed in vacuo. The 
crude mixture was subjected to a quick silica gel column with methylene 
chloride to yield 1 .44 g (96%) of 2-nitro-5-(3-0-t- 
butyldimethylsllylpropoxy)ben2aldehyde. 

= 0.67 (hexane/ethyl acetate, 5/1). 
UV (methanol), maximum: 317, 243, 21 5 nm; minimum: 235, 267 nm 
»H NMR (DMSO-de) 6 10.28 (s, 1H), 8.14 (d, 1H), 7.32 (d, 1H), 7 20 (s 
1H), 4.20 (t, 2H), 3.75 (t, 2H), 1.90 (m, 2H), 0.85 (s. 9H), 0.02 (s 6H)' 
'^CNMR (DMSO-de)<y 189.6, 162.7, 141.5, 134.0, 127.1, 1182 
25 113.8, 65.4,58.5,31.2,25.5,-3.1,-5.7. 

C. 1-(2-Nltro-5-(3-0-t-butyldimethylsilylpropoxy)phenyl)ethanoI 
High vacuum dried 2-nitro-5-(3-0-t- 

butyldimethylsilyIpropoxy)benzaldehyde (1.02 g, 3 mmol) was dissolved 
50 ml of anhydrous methylene chloride. 2 M Trimethylaluminium in 
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toluene (3 ml) was added dropwise within 10 min and keeped the 
reaction mixture at room temperature. It was stirred further for 10 min 
and the mixture was poured into 10 ml ice cooled water. The emulsion 
was separated from water phase and dried over 100 g of sodium sulfate 
to remove the remaining water. The solvent was removed In vacuo and 
the mixture was applied to a silica gel column with gradient methanol In 
methylene chloride. 0.94 g (86%) of desired product was isolated. 
Rf -0.375 (hexane/ethyl acetate, 5/1), 

UV (methanol), maximum: 306, 233, 206 nm; minimum: 255, 220 nm 
10 'H NMR (DMSO-d,, S 8.00 (d, 1H), 7.36 (s, IH), 7.00 (d, 1„, 5 49 (b' 

Z'l'll ''^^ 1-37 (d, ' 

3H), 0.86 (s, 9H), 0.04 (s, 6H). 

"C NMR (DMSO-d,) 6 162.6. 146.2, 139.6, 126.9, 112.9, 112 5 64 8 
63.9,58.7,31.5,25.6,24.9,-3.4,-5.8. ^ 2.5, 64.8, 

15 O. 1-<2-Nitro.5-(3-hydroxypropoxy)phenyI)ethanol 

1-<2-NJtro-5-(3-0-t-butyldimethylsilylpropoxy)phenyl)ethanol (0.89 
g, 2.5 mmol) was dissolved in 30 ml of THF and 0.5 mmol of nBu NF 
was added under stirring. The mixture was stirred at room temperature 
for 5 h and the soh^ent was removed in vacuo. The remaining residue 
was appHed to a silica gel column with gradient methanol in methylene 

chlonde. 1-(2-Nitro-5-(3-hydroxypropoxy)phenyl)ethanol (0.6 g (99%) 
was obtained. » » 701 

Rf = 0. 1 7 (dichloromethane/methanol, 95/5) . 

UV (methanol), maximum: 304, 232, 210 nm; minimum: 255, 219 nm 
25 ^" NMR (O^^^^^^^ . 8.00 (d, IH), 7.33 (s, IH). 7.00 (d, IH), 5.50 (^ 

2H:'r.3:(::H"^"""^"-^"^'^"^'^- 
2:zT:T::r' 
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E. 



1-(2-Nitro-5.(3-0.4,4'-dimethoxytrftylpropoxy)ph^ 
1 -(2-Nitro-5-(3-hydroxypropoxy)phenyl)ethanol (0.482 2 mmol) 
was co-evaporated with anhydrous pyridine twice and dissolved in 20 ml 
anhydrous pyridine. The solution was cooled in ice-water bath and 750 
5 mg (2.2 mmol) of DMTCI was added. The reaction mixture was stirred 
at room temperature overnight and 0.5 ml methanol was added to stop 
the reaction. The solvent was removed in vacuo and the residue was co 
evaporated with toluene twice to remove trace of pyridine. The final 
residue was applied to a silica gel column with gradient methanol in 
10 methylene chloride containing drops of triethylamine to yield 0.96 g 
(89%) of the desired product 1-(2-nitro-5-(3-0-4,4'-dimethoxytrityl- 
propoxy)phenyl)ethanol. 
R« = 0.50 (dichloromethane/methanol, 99/1). 

UV (methanol), maximum: 350 (shoulder), 305, 283, 276 (shoulder), 
15 233, 208 nm; minimum: 290, 258, 220 nm. 

NMR (DMSO-de) * 8.00 (d, 1H), 6.82-7.42 (ArH), 5.52 (d, OH) 5 32 
(m, 1H), 4.23 (t, 2H), 3.71 (s, 6H), 3.17 (t, 2H), 2.00 (m, 2H) 1 37 
(d, 3H). 

"C NMR (DMOSKle) 6 162.5, 157.9, 157.7, 146.1, 144 9 140 1 
20 139.7, 135.7, 129.5, 128.8, 127.6, 127.5, 127.3, 126.9,' 126 4' 
113.0, 112.8, 112.6, 85.2, 65.3, 63.9, 59.0, 54.8, 28.9, 24.9. ' 

F. ;,^2-Nitro-5-(3-0-4,4'-dimethoxytrityIpropoxy)phenvl)-1^ 
((2-cyanoethoxy)-diisopropylaminopiosphino^eS^ 

'-<2.Nitro-5-(3-0-4,4'-dimetfioxytritylpropoxy)phenyl)ethanol (400 
mg, 0.74 mmol) was dried under high vacuum and was dissolved in 20 
ml of anhydrous methylene chloride. To this solution, it was added O 5 
ml N,N-diisopropylethylamine and 0.3 ml (1 .34 mmol) of 2-cyanoetfiy| 
N,N-diisopropylchlorophosphoramidite. The reaction mixture was stirred 
at room temperature for 30 min and 0.5 ml of methanol was added to 
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stop the reaction. The mixture was washed with saturated sodium 
bicarbonate solution and was dried over sodium sulfate. The solvent 
was removed in vacuo and a quick silica gel column with 1% methanol in 
methylene chloride contalninq drops of triethylamine yield 510 mg (93%) 
5 the desired phosphoramidite. 

R,=0-87 (dichloromethane/methanol, 99/1). 

EXAMPLE 35 

1-{4-{3-0-4,4'-0lmethoxytrityIpropoxy)-3-methoxy-6-nitrophenyl)-1 -0-«2 
cyanoethoxy)-diisopropylaminoph6sphino)ethane 

A- 4-(3-Hydroxypropoxy)-3-methoxyacetophenone 
3-Bromo-1-propanol (53 ml, 33 mmol) was refluxed in 100 ml of 
anhydrous acetonitrile with 4-hydroxy-3-methoxyacetophenone (5 g, 30 
mmol), K2CO3 (5 g), and Kl (300 mg) overnight (15 h). 
Methylenechloride (150 ml) was added to the reaction mixture after 
15 cooling to room temperature. The mixture was filtered and the solid 
residue was washed with methylene chloride. The combined organic 
solution was evaporated to dryness and redissolved in 100 ml methylene 
chloride. The resulted solution was washed with saturated NaCI solution 
and dried over sodium sulfate. 6.5 g (96.4%) of desired product was 
20 obtained after removal of the solvent in vacuo. 
R,=0.41 (dichloromethane/methanol, 95/5). 

UV (methanol), maximum: 304, 273, 227, 210 nm: minimum: 291, 244, 
214 nm. 

'H NMR (DMSO-de) ^ 7.64 (d, 1H), 7.46 (s, 1H), 7.04 (d, 1H), 4.58 (b, 
25 OH), 4.1 2 (t, 2H), 3.80 (s, 3H), 3.56 (t, 2H), 2.54 (s, 3H), 1 .88 (m, 2H). 
"C NMR (DMSO-de) <y 196.3, 152.5, 148.6, 129.7, 123.1. 111.5, 
110.3, 65.4, 57.2, 55.5, 31.9, 26.3. 

B. 4-(3-Acetoxypropoxy)-3-methoxyacetophenone 
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25 



4-(3-.Hydroxypropoxy)-3-methoxyacetophenone (3.5 g, 15.6 mmol) 
was dried and dissolved in 80 ml anhydrous acetonitrile. This mixture 6 
ml of triethylamlne and 6 ml of acetic anhydride were added After 4 h 6 
ml methanol was added and the solvent was removed in vacuo The ' 
residue was dissolved in 100 ml dichloromethane and the solution was 
washed with dilute sodium bicarbonate solution, tiien water. The 
organic phase was dried over sodium sulfate and the solveni was 
removed. The solid residue was applied to a silica gel column with 
metiiylene chloride to yield 4.1g of 4-(3-acetoxypropoxy)-3- 
10 methoxyacetophenone (98.6%). 

Rf = 0.22 (dichloromethane/methanol, 99/1). 

217^'^^"*'"' "'^'"''""'"^ 21°""'' minimum: 290, 243, 

'H NMR (DMSO-de) S 7.62 (d. IH), 7.45 (s, IH), 7,08 (d, IH) 4 12 (m 
15 3.82 (s, 3H), 2.54 (s, 3H), 2.04 (m, 2H), 2.00 (s, 3H) 

"C NMR (DMSO-de) 6 196.3, 170.4, 152.2, 148.6, 130.0, 123 O 
1 1 1.8, 1 10.4, 65.2, 60.8, 55.5, 27.9, 26.3, 20.7. 

C. 4-(3-Acetoxypropoxy).3-methoxy-6-nitroacetophenone 
4-(3-Acetoxypropoxy)-3-metiioxyacetophenone (3.99 g, 15 mmol) 
was added portionwise to 15 ml of 70% HNO3 in water bati, and keep 
the reaction temperature at tiie room temperature. The reaction mixture 
was Stirred at room temperature for 30 min and 30 g of crushed ice was 
added. This mixtiire was extracted witii 100 ml of dichlorometfiane and 
the organic phase was washed witii saturated sodium bicarbonate 
solution. The solution was dried over sodium sulfate and the solvent 
was removed in vacuo. The crude mixture was applied to a silica gel 
column with gradient metfianol in methylene chloride to yield 3 8 g 
(81.5%) of desired product 4-(3-acetoxypropoxy)-3-methoxy-6- 



20 
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nitroacetophenone and 0.38 g (8%) of ipso-substituted product 5.(3 

acetoxypropoxy)-4-methoxy-1,2-dinitrobenzene. 

Side ipso-substituted product 5-(3-acetoxypropoxy).4-methoxy.1 2 
dinitrobenzene: 

R,"0.47 (dichloromethane/methanol, 99/1). 

UV (methanol), maximum: 334, 330, 270, 240 212 nm. »• • 

282, 263, 223 „m. 

'H NMR (COa,) i 7.36 (s, 1H), 7.34 (s, 1H|, 4.28 (,, 2H) 4 ,8 ,t 2H> 
4 02 (s, 3H), 2.20 (m, 2H), 2.08 (a, 3H) 
10 "C NMR iCOat i ,70.9. ,62.2, ,S,.,, „7.6. ,,,.2 ,07 9 ,07 , 
66.7, 60.6, 56.9, 28.2, 20.9. ' 

rj,-o.29 (dichloromethane/methanoi, 99/1). 

IB 27V™* nm"' ""'"^"^ ^' 

y NMR ,COa,, 6 7.62 (a, ,H), 6.74 (a. ,H), 4.28 (t 2H), 4 20 (, 2H) 
3.96(a,3H),2.48,a,3H),2.20,m,2H,,2.08(a,3H) ^'^^ 
"C NMR (COa,) ,200.0. ,7,.0, ,B4.3, ,48.8, ,38.3, ,33 O ,08 8 
108.0,66.1,60.8,56.6,30.4,28.2,20.9. ^^O- 108.8, 

!.„ J'*'"^'"'"'^'*°'^'-'^»*»'^-e-""«'Phe„y,)e*a„ol 
4-(3-Ace,oxypropoxy).3^,hoxy.6-m,roaca,ophe-„one (3.73 g 12 
mmol) waa added 1 50 ml ethanol and e.B g of IC,CO Th. ^-^ 
stirred at room temperature for 4h and TLC wfth 5% methanol in 

d.oh.„ome*aneindioa.ed the completion of the reaction. To this same 
reason m.>«ure. i, Was added 3.S g of NaBH. and the mi«ure was 
s^rxed at room temperature for 2h. Acetone ,10 ml, was added to react 
w^h the rema.„,.NaBH.. The solvent was removed in vacuo and the 
«.due was upta^en into 50 g of silica gel. The silica gel mixture was 
appl. d on the top o, a sHica gel column with 5% methanol in methWene 
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10 



chloride to yield 3.15 g (97%) of desired product 1-(4-(3- 
hydroxypropoxy)-3-methoxy-6-nitrophenyl)ethanol. 
Intermediate product 4-(3-hydroxypropoxy)-3-methoxy-6- 
nitroacetophenone after deprotection: 
R,=0.60 (dichloromethane/methanol, 95/5). 

Final product l-(4.(3-hydroxypropoxy)-3-methoxy-6-nitrophenyOethanol: 
Rf-0.50 (dichloromethane/methanol, 95/5). 

UV (methanol), maximum: 344, 300, 243, 219 nm: minimum- 317 
264, 233 nm. * ' 

'H NMR (DMSO-d«) 6 7.54 (s, 1H), 7.36 (s, 1H), 5.47 (d, OH) 5 27 (m 
1H), 4.55 (t, OH), 4.05 (t, 2H), 3.90 (s, 3H), 3.55 (q, 2H), 1.88 (m 2H) 
1-37 (d,3H). 

"C NMR (DMSO-de) 6 153.4, 146.4, 138.8, 137.9, 109.0, 108 1 68 5 
65.9,57.2,56.0,31.9,29.6. ' *' 

nitrophenyDethanol ' 

1 -<^(3-Hydroxypropoxy)-3-methoxy-6-nitropheny|)ethanoI (0.325 
1.2 mmol) was co-evaporated with anhydrous pyridine twice and 
dissolved in 15 ml anhydrous pyridine. The solution was cooled in ice- 
20 water bath and 450 mg (1 .33 mmol) of DMTCI was added. The reaction 
mixture was stirred at room temperature overnight and 0.5 ml methanol 
was added to stop the reaction. The solvent was removed in vacuo and 
the residue was co-evaporated with toluene twice to remove trace of 
pyridine. The final residue was applied to a silica gel column with 
25 gradient methanol in methylene chloride containing drops of tnethylamine 
to yield 605 mg (88%) of desired product 1 -(4-(3-0-4,4'- 

dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)ethanol. 
R, = 0.50 (dichloromethane/methanol, 95/5). 
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UV (methanol), maximum: 354, 302, 282, 274 233 onQ • - 

, ^/H^, ^jj^ f^f^. niinimum: 

322, 292, 263, 222 nm. 

•H NMR (DMSO-de) ^ 7.54 (s, 1H), 6.8-7.4 (ArH), 5.48 (d, OH) 5 27 (m 
1H), 4.16 (t, 2H), 3.86 (s, 3H), 3.72 (s, 6H), 3.15 (t, 2H), 1.98 (t 2H) ' 
5 1.37 (d,3H). 

"C NMR (DMSO-de) ^ 157.8, 153.3, 146.1, 144.9, 138.7. 137 8 
135.7, 129.4, 128.7, 127.5, 127.4, 126.3, 112.9, 112.6, 108 9' 
108.2, 85.1, 65.7, 63.7, 59.2, 55.8, 54.8, 29.0, 25.0. 

10 ^' \"*^*?"°"^''*'-'^''"^*o*Vtritylpropoxy)-3-methoxv-6- 

^° nitrophenyl)-1-0.((2-cyanoethoxy). ^ 

diisopropylamlnophosphlno)ethane 

1 -(4-(3-0-4,4'-Dimethoxytritylpropoxy)-3-methoxy-6- 
nitrophenyDethanol (200 mg, 3.5 mmol) was dried under high vacuum 
and was dissolved In 15 ml of anhydrous methylene chloride. To this 
15 solution, it was added 0.5 ml N,N-diisopropylethylamine and 0.2 ml (O 89 
mmol) of 2-cyanoethy(-N,N-diisopropylchlorophosphoramidite. The 
reaction mixture was stirred at room temperature for 30 min and O 5 ml 
of metiianol was added to stop tfie reaction. The mixture was washed 
wrth saturated sodium bicarbonate solution and was dried over sodium 
20 sulfate. The solvent was removed in vacuo and a quick silica gel column 
with 1 % methanol in metiiylene chloride containing drops of 
trietfiylamine yield 247 mg (91 .3%) the desired phosphoramidite 1-(4-(3- 

0-4,4'-dimetiioxytritylpropoxy)-3-methoxy-6-niti-ophenyl).1.0-((2- 
cyanoethoxy)-diisopropylaminophosphino)ethane. 
25 R,=0,87 (dichloromethane/methanol, 99/1). 
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EXAMPLE 36 



Oligonucleotide synthesis 

The oligonucleotide conjugates containing photocleavable linker 
were prepared by solid phase nucleic acid synthesis (see: Sinha et al 
5 a^^ahedrsnX^ 1983, 24, 5843-5846; Sinha ^ aL Nusteic^V^ii^ 
1984, 12, 4539-4557; Beaucage et aL Tg^afagdron ^9^3^^~^^^^ 
6194; and Matteucci aL ^. Am. Chem Sor. 1981, 103^ 3185-3191, 
under standard conditions. In addition a longer coupling, time period was 
employed for the incorporation of photocleavable unit and the 5' terminal 
am.no group. The coupling efficiency was detected by measuring the 
absorbance of released DMT cation and the results indicated a 
comparable coupling efficiency of phosphoramidite 1 -(2-nitro-5-(3-0 4 4' 
dimethoxytritylpropoxy)phenyl)-1-0-((2-cyanoethoxy)- 
dilsopropylaminophosphino)ethane or 1-(4-(3-0-4 4'- 

15 ^-ethoxytritylpropoxy,-3-methoxy-6-nitrophenylM-0-((2^^^^^ 

d..sopropylaminophosphino)ethane with those of common nucleoside 
Phosphoramodites. Deprotection of the base protection and release of 
the conjugates from the solid support was carried out with concentrated 
ammonium at 55 oc overnight. Deprotection of the base protection of 
other conjugates was done by fast deprotection with AMA reagents 
Purification of the MMT-on conjugates was done by HPLC (trityl-on)' 
using 0.1 M triethylammonium acetate, pH 7.0 and a gcadient of 
acetonitrile (5% to 25% in 20 minutes). The collected MMT or DMT 
protected conjugate was reduced in volume, detritylated with 80% 
25 aqueous acetic acid (40 min. O <»C), desalted, stored at -20oc 
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EXAMPLE 37 



Photolysis study 

In a typical case, 2 nmol of oBgonucleotide conjugate cont„„- 
Photocleavablo linker in 200^ distilled water v«,s i Jdia^ T^"^ 
5 waveleng«, UV lan,p (Blak Ray XX-,6 UV lamo ' ' 

Oa..el. C. a. a diatanee oMO cm .emiaJr^rrr ^ 

DH 7 n . <tntyl^ff) us-ng 0. 1 M triethylamntonium acetate 

PH 7.0 and a gradient ofacetonitrile. Analyse showed that the 

iZrr^ ^ — upon j: 

Equivalents 

Those skilled in the art will recognize, or be able to ascer, • 
no more than to«.„e expe„me„t3«o„. numerous 0^1^ """^ 

'5 apeciflc procedures described herein. Such e,uivelents 1 "1 

w.. the scope o, this inyention and are leredrinr 
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SBQOBNCB lOrSTING 

(1) GENERAL INPORMATION 

(i) APPLICMUT: 

(A) N7VME: SEQOENQM, INC. 

(c! SJ??f 'sai'S^r"^^^" ^^^^^ ^<-<^ 

(D) STATE: California 

(E) COONTRY: OSA 

(F) POSTAL CODE (ZIP) : 92121 

(i) INVENTOR/ APPLICANT: 

(A) NAME: Hubert Koster 

(B) STREET: 836 Via Mallorca Drive 

(C) CITY: La OTolla 

(D) STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 52037 

(i) INVENTOR/APPLICANT: 

<A) NAME: Kai Tang 

;2? ^521 Sumraerdale Rd #241 

(C) CITY: San Dieao 

(D) STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 92126 

( i ) INVENTOR/APPLICANT : 

(A) NAME: r:ong-Jing Pu 

(B) STREET: 10615 Dabney Dr, #21 

(C) CITY: San Diego 

(D) STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 92126 

(i) INVENTOR/APPLICANT: 

itl Carsten W. Siegert 

(B) STREET: Geilfitr. 42 

(C) CITY: 22303 Haznburxr 

(D) STATE: ««««uxH 

(D) COUNTRY: Germany 

(E) POSTAL CODE (ZIP) : 

( 1 ) INVENTOR/APPLICANT : 

(A) NAME: Daniel P. Little 

(B) STREET: 393 Glendale Lake Rd. 
tC) CITY: Patton 

(D) STATE: Pennsylvania 
W) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 18668 

(i) INVENTOR/APPLICANT: 

(A) NAME: G, Scott Higgins 

(B) STREET: Haselweg 1 

(C) CITY: 22880 Weidel 

(D) STATE: 

(D) COUNTRY: Gezmany 

(E) POSTAL CODE (ZIP) : 
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(i) INVENTOR/APPLICMIT: 

(A) KA ME: Andreas Braun 

(B) STREET: X3232 Benchley Road 

(C) Ciry: San Diego 
<D) STATE: California 

(D) CX>UNTRY: aSA 

(E) POSTAL CODE (ZIP) : 92130 

(i) INVENTOR/APPIilCANT: 

J»l Brigitte Damhof f er-Detaar 

;5! Haines St. #8-308 

(C) CrVYz San Diego 

(D) STATE: California 
(D) OODNTRY: USA 

<E) POSTAL CODE (ZIP) : 92109 

(i) INVENTOR/APPLICANT: 

(A) NAME: Christian Jurinke 

(B) STREET: Grope Hall 68 

(C) CITY: 22115 Hamburg 

(D) STATE: 

(D) OODNTRY: Germany 

<E) POSTAL CODE (ZIP) : 

(i) INVENTOR/APPLICANT: 

(A) NAME: Dirk Van den Boom 

(B) STREET: Forsthausstr . 8 
CC) CITY: 633303 Preiech 
(D) STATE: 

(D) OOONTRY: Germany 

(E) POSTAL CODE (ZIP) : 

(i) INVENTOR/APPLICANT: 

(A) NAME: Goubing Xiang 

^2^^- ^^3®^ Zapata Ave. 

(C) CITY: San Diego 

(D) STATE: California 
CD) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 92126 

(11) TITI^ OP THB INVENTIOH: DNA DIAONOSTICS BASED OU MASS SPECTOOMBTOy 
(ill) NUMBER OF SEQO^CES : 320 

(iy) CORRESPONDENCE ADDRESS: 

(n\ SS™^^^^ Martin, Haller & McClain 

(B) STREET: 1660 Union Street "ccxain 

CC) CITY: San Diego 

CD) STATE: CA 

CE) COUNTRY: USA 

CF) ZIP: 92101-2926 

(▼> COMPUTER READABLE FORM- 
(A) MEDIUM TYPE: Diskette 

JSI SSI^^'™^- Compatible 

CC) OPERATING SYSTEM: DOS 

CD) SOFTWARE: PastSBQ Version l.S 

Cvi) CURRENT APPLICATION DATA: 
CA) APPLICATION NUMBER: 
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(B) PILING DATE: 06-IIOV-1997 

(C) CLASSIFICATION: 

^"^/^l f^°^ APPLICATION DATA: 

(A) APPLICATION NOMBBR: 

(B) PILING DATE: 10/08/97 

(vii) PRIOR APPLICATION DATA: 

S "CATION NUMBER: 08/933,792 

(B) PILING DATE: 09/19/97 

^"^il APPLICATION DATA: 

^"CATION NOMBER: 08/787,639 
(B) FILING DATE: 01/23/97 

^^/xi f*^®*^ APPLICATION DATA: 

IbI ^f"CATION NUMBER: 08/78€,988 
(B) FILING DATE: 01/23/97 

'""/^J fSi?'* APPLICATION DATA: 

APPLICATION NUMBER: 08/746,055 
(B) PILUJO DATE: 11/06/96 

(vii) PRIOR APPLICATION DATA: 

{b ''°'«BER: 08/746.036 

(B) PILING DATE: 11/06/96 

*"^aI APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/744,590 

(B) FILING DATE: 11/06/96 ' 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/744 481 

(B) FILING DATE: 11/06/96 

INFORMATION: 
JA) NAME: Seidman, Stephanie L 
tnl S5?£f^TION NUMBER: 33,779 

(C) REFERENCE/DOCKET NUMBER: 73S2-2004PC 

'^S?S2f?*™"^<^I«»' INFORMATION: 
(A) TBLEPHQKB: 619-238-0999 

S*^™^' 619-238-0062 
tC) TELEX: 



(2) 



INFORMATION FOR SEQ ID NO:l; 



(i) SEQUENCE CHARACTERISTICS: 

(B) T»E: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

i^^i^l ^tVOX^BCOLS TYPE: cDNA 
(iii> HYPOTHETICAL: NO 
(IV) ANTISENSE: MO 
(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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<3CAAGTGAAT CCTGAGCGTG 

(2) INFORMATION FOR SEQ ID NO:2r 
<i) SEQUENCE CHARACTERISTICS- 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: unknown 

Jff?^MOLECOIiE TYPE: cDNA 
Jill) HYPOTHETICAL: NO 

(IV) ANTISENSE: KO 

(V) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
<^TGTGMiGGG TTCATATGC 

(2) INFORMATION FOR SEQ ID NO: 3: 

(1) SEQUENCE CHARACTERISTICS- 
R 1^^' 2^ pairs ' 

i^l nucleic acid 

;C) STRANDEDNESS: single 
(D) TOPOLOGY: unknot 

5H!,^^CmE TYPE: cDNA 

(III) HYPOTHETICAL: NO 

(IV) ANTISENSE: NO 
(v) FRAOIENT TYPE- 
(VX) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATCTATATTC ATCATAGGAA ACACCACA 

(2) INFORMATION FOR SEQ ID NO: 4: 
^^ISEQUmCB CHARACTERISTICS- 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
<iil) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
C^) FRAOffiNT TYPE: 
(VI) ORIGINAL SOtJRCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:4: 
OTATCTATAT TCATCATAGG AAACACCATT 

(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(B) TYPE: nucleic acid 
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28 



30 
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(C) STRANDEDNESS : single 

(D) TOPOIOGY: unknown 

(ii) MOLECOLE TYPE: cDNA 

(iii) HYKXTHETICAL: NO 

(iv) ANTISEMSE: KO 

(v) FRACTIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
<5CTTTGGGGC ATGGACATTC3 ACCCGTATAA 

30 

(2) INFORMATION FOR SEQ ID NO:€: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOUXSfY: imknown 

(ii) MOLECULE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINTkL SOURCE: 

(xi) SEQUENCE DESCRIPTIW: SEQ ID NO: 6: 
CTGACTACTA ATTCCCTGGA TGCTC3GGTCT 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUE NCE CHARACTERISTICS- 

(A) LENGTH: 20 i>ase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOIHBTICAL: NO 

(iv) ANTISBNSE: IK) 

(v) FRAOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TTCCCTGAGT GCAGTATOGT 

20 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(lli) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 



30 
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(v) FRAGMENT TYPE- 
<vi) ORIGINAL SOTOCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO'S- 
^^'^^ATAT COGGAAGCCT 

(2) INFORMATION FOR SEQ ID NO .-9: 

*^(Af^^ CHARACTERISTICS: 
ttl iS^*^' 24 Haae pairs 

nucleic acid 
i^l STRANDBDNESS: single 
CD) TOPOI^: unknot 

(ii) TOLECOLE TJTPE: cMIA 
<lii) HYPOTHETICAI.: NO 
(iv) ANTISENSE: NO 
(V) FRAOffiNT TYPE - 
(VX) ORIGINAL SODRCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ n, no-S- 
"««5CCACG CGGTTOGGAA TGTA 

(2) INFORMATION FOR SEQ ID NO: 10: 

(B, TVpfTnJLf^^L?-" 

(C STRANDEDNESS: Single 
(D) TOPOLOGY: unknown 

/ft!.'^25fS5!5_TYPE: cDNA 

HTfPOTHETICAL: NO 
(iy) ANTISENSE: NO 
JV) FRAiOfENT TYPE- 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:10- 
*«»*«3ACT GTTIXSCCCGC CAGTTC 

<2) INFORMATION FOR SEQ ID NO: 11: 

^^Lf^S^ CHARACTERISTICS- 
(A l^araTH: 25 base pairs 
(B TYPE: nucleic acid 
Jm S^ANDEDNESS: single 
(D) TOPOLOGY: unknown^ 



20 



24 



26 



<if) MOLECULE TYPE: cDNA 

(ly) ANTISENSE: NO 
«v) FRA01ENT TYPE- 
(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
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TACATTCCCA ACCX5CGTX3GC ACAAC 

25 

(2) INFORMATION FOR SEQ ID NO:12: 

(1) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<I>) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cONA 
(fii) HYPOTHETICAL: NO 
(iv> ANTISENSE: NO 

(v) FRAOfENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AACTGGCGGG CAAACAGTCG TTGCT 

25 

(2) INFORMATION FOR SEQ ID NO: 13: 

(1) SEQUE NCE CHARACTERISTICS* 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GCAAGTGAAT CCTGAGCGTG 

20 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unlcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
<3T6TGAAGGG CGTG 

14 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS - 
(A) LENGTH: 24 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: xinknown 

(ii) MOIiECaiiE TYPE: cDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CTATATTCAT CATAGGAAAC ACCA 

(2) INFORMATION FOR SEQ ID NO:l6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 18 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: tmknown 

(ii) MOLECULE TYPE: d^lA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
GTCACCCTCG ACCTCCAG 

(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IjENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA<a<ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TTGTAAAACG AOGGCCAGT 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
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(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTTCCACOGC GATGTTGA 

18 

(2) INFORMATION FOR SEQ ID NO: 19: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAGGAAACAG CTATGAC 

17 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) S3BQUENCE CHARACTERISTICS- 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJoiCwn 

(ii) MOLECULE TYPE: cONA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GTAAAACGAC GGCCAGT 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)cnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAC^fENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GTCACCCTCG ACCTGCAGC 



17 
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(2) INFORMATION FOR SEQ ID NO.-22: 

^^kf^?^^ CHARACTERISTICS: 
inl 20 base pairs 

fpANDEDNESS: single 
(D) TOPOUX3Y.. unknown 

Jf f '*°I'ECai.E TYPE: cDNA 
Jiii) HYPOTHETICAL: NO 
Uv) ANTISENSE: NO 
(V) PRACaiENT TYPE- 
(vx) ORIGINAL SOURCE: 

(Xi) SBOTENCE DESCRIPTION: SEQ ID NO: 22: 
^'"^AAAAC GAGGGCCAGT 

(2) INFORMATION FOR SEQ ID NO:23: 
fEOamCE CHARACTERISTICS - 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

Jfla.) HYPOTHETICAL: NO 
(XV) ANTISENSE: NO 

(v) FRAGMENT TYPE- 

(vi) ORIGINAL SOURCE: 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

'^^^^^ecCTGG TGCAGGGCCT ATTGTAGTTG TCACGTACA 

<2) INFORMATION FOR SEQ ID NO:24: 

<i) SEQUENCE CHARACTERISTICS - 

(B) TYPE: nucleic acid 
(C} STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

f^? WOI^CDLE TYPE: cDNA 
Jiil) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 
tv) FRAGMENT TYPE- 
Cvx) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:24- 
WSTACGTCAC AACT 



39 



(2) INFORMATION FOR SEQ ID NO:25: 

^^/>f^^^ CHARACTERISTICS: 
!b pairs 
(B) TYPE; nucleic acid 
JC) STRANDEDNESS: sinole 
(D) TOPOLOGY: unknown 
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(ii) MOLECDLE TYPE: cDHA 
(Xii) HYPOTHETICAL: NO 
(Iv) ANTISEKSE: NO 
<v) FRAGMENT TYPE- 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

(2) INFORMATION FOR SEQ ID NO:26: 
f^TJKNCB CHARACTERISTICS- 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

^.^P, ^^^CXnjB TYPE: cDNA 
(iix) HYPOTHETICAL: NO 

(iv) ANTISENSE: IK) 

(v) FRA)C9<ENT TYPE: 

(vi) ORIGINAL SOORCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
CTGATGCGTC GGATCATC 

(2) INFORMATION FOR SEQ ID NO: 27: 

^^Lf^^°™^ CHARACTERISTICS- 

(B) TYPE: nucleic acid 
- (C) STRANDEI»IESS : single 
(D) TOPOLOGY: unJcnown 

Jff >^«OI^CDLE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 

(iv) ANnSENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
GATGATCCGA CGCATCACAG CTC 

(2) INFORMATION FOR SEQ ID NO: 28: 
^^Lf^^^™ CHARACTERISTICS- 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)cnown 

f f f j . MOLECULE TYPE: cDNA 
Jlli} HYPOTHETICAL: NO 
(iy) ANTISENSE: NO 
(V) FRAGMEtrr TYPE: 
(vi) ORIGINAL SOURCE: 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
TCGGTTCCAA GAGCTX3TGAT GCGTCGGATC ATC 

(2) INFORMATION FOR SEQ ID NO: 29: 

(1) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: unknown 

(ii) MOLECULE TTPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

( v) FRA04ENT TYPE : 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GATGATxrOGA OGCATCACAG CTC 

<2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) M0LECUL2 TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA)C31ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GTGATGCGTC GGATCATC 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
TCX3GTTCCAA GAGCT 

(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOIiECDLE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQDENCE DESCRIPTION: SEQ ID NO: 31: 
^^^^^5GTTCCAA GAGCT 

15 

(2) INFORMATION FOR SEQ ID NO: 32: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEIHiESS : single 
<D) TOPOLOGY: untoo%m 

(ii) MOLECULE TYPE: cDNA 
(lli) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOORCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CATTTGCTTC TGACACAACT G 

. (2) INFORMATION FOR SEQ ID NO: 33; 

(i) SEQUENCE CHARACTERISTICS- 
> (A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unlcnown 

(ii) MOLECULE TYPE: cDNA 
(ili) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
(V) F7U104ENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
^^"WrPGTC TCCACATGC 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQU ENCE CHARACTERISTICS - 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: cDNA 
(a-li) HYPOTHETICAL: NO 



21 
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(ly) ANTISBKSE: NO 
(V) FRAOIENT TYPE- 
(Vi) ORIGINAL SOTOCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TGCACCTCSAC TC 

(2) INFORMATION FOR SEQ ID NO:3S: 
^^Lf^?^!^ CHARACTERISTICS: 

HH^" nucleic acid 
i^l STOANDEDMESS: single 
<D) TOPOLOOT: uakno*^^ 

;fji,'*£Jf9m£^TYPE: cDNA 
<1") HYPOTHETICAL: NO 
(ly) ANTISEMSE: NO 
jv) FRAOffiNT TYPE: 
(vi) ORIGINAL SOORCE: 

(Xi) SEQOENCE DESCRIPTION: SEQ ID NO SS- 
'^''^ACTTA ACCCAGTGTG 

(2) INFORMATION FOR SEQ ID NO: 36: 

'^kf^SSJ^ CHARACTERISTICS: 
(A) LEMSTH: 20 base Mirs 

ia\ S^f = lew 
- (C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(111) HYPOTHETICAL: NO 
Jiy) ANTISEHSE; NO 
<v) PRAOIENT TYPE: 
(Vi) ORIGINAL SOORCE ! 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOz36. 
CACACTATGT AATACTATSC 

(2) INFORMATION FOR SEQ ID NO: 37: 
*^(Af*5SiS CHARACTERISTICS: 

I^^'- nucleic acid 
Jm ^JR^NDEDMESS: single 
(D) TOPOLOGY: unlmown 

5f f ? . '*°I'ECDLE TYPE: cDNA 
)"f>,S[«»HETICAL: NO 
(ly) ANTISENSE: NO 
jv) FRAOIENT TYPE: 
(VI) ORIGINAL SOORCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
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GAAAATATCT GACA/kACTCA TC 

22 

(2) INFORMATION FOR SEQ ID NO:38: 

(1) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

- (C) STRANDBDNBSS: single 
(D) TOPOLOGY: tinknown 

(ii) MOLECOLE TYPE: cDMA 

(iii) HYPOTHETICAL: NO 
(XV) ANTISENSE: NO 

<v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: " 
CATGGACACC AAATTAAGTT C 

21 

(2) INFORMATION FOR SEQ ID NO:39: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 14 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 

(vi) ORIGINAL SOURCE: 

Xxi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
TGAGACTCTG TCTC 

14 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQU ENCE CHARACTERISTICS : 

(A) LENGIH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: imknom 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRACadENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
TTCCCCAAAT COCTC 

15 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: IS base pairs 
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(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOI/XSY: ^m)alo%m 

(ii) MOI*ECai*E TYPE: cDNA 

(iii) HYPOTHETICAIi: NO 

(iv) ANTISENSE: NO 

(v) FRAjC^IENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
GGCACGGCTG TCCAAGGAG 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single . - 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) PRAjOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
AGGCCGC6CT CGGOGCCCTC 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
(SOGOACATGG AGGACGTC 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) I«ENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CENA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
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(v) FRA04ENT TVPE: 

(vi) ORIGINAL SOURCE: 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GATGCCGATG ACCTGCAOAA G 

21 

.(2) INPOR^4ATION FOR SEQ ID NO:4S: 

(1) SEQU ENCE CHARACTERISTICS: 

(A) I*ENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECUIiE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
CXXTTTAOCCT TACCCTTACC CTAA 

24 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sincle 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: IK) 

(v) FRAOiENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
AATCOGTQCA GCAGAGTT 

(2) INFORMATION FOR SEQ ID NO:47: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(Iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
^'^^^^CAGAGCT GGACAAGTCT 

20 
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(2) INFORMATIOM FOR SEQ ID NO MB: 

'^kf^PSS CHARACTERISTICS.- 

iul iSS^' 20 base pairs 

J? I^^'' °«clelc acid 
- /«( fpANDEDNESS: sinole 

(D) IPPOLOGY.- unknown 

(iil) HYPOTHETICaL: MO 
(ly) ANTISEMSE: MO 

(V) pRAOffiNT TYPE: 

(VI) ORIGINAL SOORCE: 

SBQOENCE DESCRIPTION: SEQ ID NO: 48- 
GATATTGTCT TCCCGGTAGC 

(2) INFORMATION FOR SEQ ID NO: 49: 

*^}.f^^ CHARACTERISTICS: 
20 base pairs 
S S^' nucleic acid 
ml STRANDEDHBSS: single 
CD) TOPOLOGY: unknown 

Jfif) HYPOTHETICAL: MO 
Uy) ANTISENSE: MO 
(V) FRAGMENT TYPE: 
(vl) ORIGINAL SODRCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO -49. 
^^^^^fSGACCAG GTOTACCGCC 

(2) INFORMATION FOR SEQ ID NO: 50: ^ 

"<Af^Si^ CHARACTERISTICS: 
<B1 iS^' 21 base pairs 
J?i n««leic acid 

£?»NDEDNBSS: single 
(D) TOPOLOGY: unkno*^^ 

5ttf^S^'°™ETICAL: NO 
(iy) ANTISENSE: NO 
JV) FRAGMENT TYPE: 
<Vi) ORIGINAL SOORCE: 

(XI) SBQOENCE DESCRIPTION: SEQ ID NO-SO- 
«^I^CTQG AAGGCGATCT C 

(2) INFORMATION FOR SEQ ID NO:5l: 

'^Lf^H^MCE CHARACTERISTICS: 

(A) LaraTH: 20 base pairs 

(B) TYPE: nucleic acid 
JC) STRANDEDNESS: single 
(D) TOPOLOGY: unfcno%m 



20 
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<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL- NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE- 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:5l: 
CATGAGGCAG AGCATAOGCA 

20 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS - 
J«i iS^^' 20 base pairs 
(B) TYPE: nucleic acid 

(C STRANDEDNESS: single 
(D) TOPOLOGY: unknovm 

(ii) M3LECDLE TYPE: cDNA 

(3.XX) HYPOTHETICAL: NO - - 

(iV) ANTISENSE: NO 

(v) FRAC94ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
GACAGCAGCA CCGAGAOGAT 

(2) INFORMATION FOR SEQ ID NO:53: 
(i) SEQUENCE CHARACTERISTICS- 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 

(if) MOLECULE TYPE: cDNA 
(XXX) HYPOTHETICAL: NO 
(XV) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: S3: 
^^^^^^<^TGCXSAT CACCGTX3CGG 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SBQtJraiCE CHARACTERISTICS- 

itl pairs 

IB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovm 

^^^'BCUl^ TYPE: CI»IA 
(Xix) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE- 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 
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GATCCACTGT GCGACGAGC _ 

(2) INFORMATION FOR SEQ ID KO:55: ^ 

(1) SBQUEKCE CHARACTERISTICS: 
(A) IjBNGTH: base pairs 
iB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECOIjE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEKSE: NO 

(v) FRAiCSfENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GCGGCTGCGA TCACCGTGC 

19 

(2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) IiENGTH: base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: un)aiown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TY:?H: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
TGCACCTGAC TC 

12 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IIENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: imknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
CTGTGGTCGT GC 

12 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 
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(D) TOPOIiOGY: unknown 

(ii) MOI-ECDI4E TYPE: cDNA 

(iii) HYPOOHETICAL: KO 
(XV) ANTISEHSE: MO 

(v) FRAGMENT OYPE: 
<vi) ORIGINAL SODRCE: 

(Xi) SEQUENCE DESCa^IPTION: SEQ ID NO:58: 
GAGTCAGGTG CGCX^ATGCCT CAAACAGACA CCATX3GOGC 
(2) INPORHATION FOR SEQ ID NO:S9: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 2X base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: iinfcnown 

(ii) MOLECULE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 

(v) FRACMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
TCTCTGTCTC CACATGCCCA G 

(2) INFORMATION FOR SEQ ir NO:€0: 

(i) SEQUENCE CHARACTERISTICS- 

-CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)aiown 

(ii) MOI-ECULE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
ACCTiU^ CA<mca;,CT C^OATAATAC GACn^ 

/^t €1 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS- 

(A) I^HGTH: 21 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unkno%m 

l^^l^^l^CaiSB TYPE: cDNA 
Clii) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 
(v) FRAGMENT TYPE: 
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(vi) ORIGINAI, SOURCE : 

(Xi) SEQOENTCE DESCRIPTION: SBQ ID NO: 61: 
AACTAAGCCA TGTGCACAAC A 

(2) INFORMATION FOR SEQ ID NO: 62: 

(1) SEQUENCE CHARACTERISTICS - 

(A) I^GTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOIiECOLE TYPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
UCCGGUCUGA OOAGTCCGUG AGGAC 

25 

(2) INFORMATI<»I FOR SEQ ID NO: 63: 

(1) SEOTENCE CHARACTERISTICS- 

(B) TYPE: nucleic acid 

(C) STRANDEIMIESS: single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: RNA 
("X) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
GUCACOACAG GOGAGCUCCA 

20 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS- 
;»! 2^ *>ase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

(i±) MOLECULE TYPE: RNA 
(iii) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRA£3(BNT TYPE- 

(vi) CMaOINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
CAUGCGAGA GOAAGOAGUA 

(2) INFORMATION FOR SEQ ID NO: 65: 
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(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 50 base pairs 

(B) TyPE: nucleic acid 

(C) STRANDEDNESS : sinale 

(D) TOPOIiOGY: tmknoTO 

(ii) MOIiECOIiE TyPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOORCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
AGGCCUGCGG CAAGAOGGAA AGACCAUGGU CCCONAUCUG CCGCAGGAUC 
(2) INFORMATION FOR SEQ ID NO:66: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FTIAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CATTTGCTTC TGACACAACT 

(2) INFORMATION FOR SEQ ID NO:67: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: xinknown 

(ii) MOLECULE TYPE: cEWA 

(iii) HYPOTHETICAL: NO 
<1V) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 
''^<^'rCTGTCTC CACATGCCCA G 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS- 

(A) I^NGTH: 24 baise pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(P) TOPOLOGY: unkno%m 

(ii) MOLECULE TYPEz cDNA 
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(ixa) HYPOTHETICM,: NO 
(lyj ANTISKMSE: NO 

(V) FRAGMENT TYPE- 

(VI) ORIGINAI. SOORCE: 

(Xi) SEQDENCE DESCRIPTION.- SEQ ID NO:68 
^TCOTCCCAT QGTGCACCTG ACTC 

(2) INFORMATION FOR SEQ ID NO.-6S: 

m £?»»n>KDNESS.. single 
(D) TOPOIOGY: unknot 

(il> MOLECDLE TyPE.- cDNA 
(xii) HypOTHETICAL: NO 
Uy) ANTISENSE:1nS 
(V) FRAGMENT TYPE- 
(vi) ORIGINAL, SOURCE: 

(Xi, SEQUENCE DESCRIPTION: SEQ ID NO-SS- 
«5CTGTGGTG AGGCCCTSGQ CA 

(2) INFORMATION FOR SEQ ID NO:70: 

in £?»NDEDNESS: single 
(D) TOPOLOGY: unkno^^ 

(if) MOLECOLE TYPE: cDNA 
(xii) HYPOTHETICAL HO 
(ly) ANTISENSE: MO 
(y) FHAGMEHT TYPE: 
(Vi) ORIGINAL SOORCE: 

(Xi) SEQOENCE DESCRIPTION: SEQ ID NO:70. 
*°S*a»CT GCTACCTCAC TCCA 

(2) INFORMATION FX)R SEQ ID NO: 71: 

'"(Af^^.^^f^^STICS: 
roi t^Z^ ' base pairs 

/ni f?RANDBDNESS: single 
"» TOPOLOGY: unknown 

(Ui)'^SlS^™- *=DNA 

HYPOTHETICAL: NO 
(xy) ANTISBUSE: TO 
tv) FRAGMENT TYPE- 
(Vl) ORIGINAL SOURCE: 

(Xi) SEQOENCE DESCRIPTION: SEQ ID NO:7l: 
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ACAGCGGACT GCTACCTGAC TCCA 

(2) INFORMATION FOR SEQ ID NO: 72: 

(1) SEQOENCE CHARACTERISTICS: 

(A) I^QTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: unJcnown 

(ii) MOliECOIiE TYPE: CDNA 
(lii) HYPOTHETICAL: NO 

(iv) ANTISEHSE: NO 

(v) PRAG91ENT TYPE; 

(vi) ORIGINAL SOURCE: 

(Jci) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
TGGAGTCAGG TAGCAGTC 

(2) INFORMATION FOR SEQ ID NO: 73: 

(1) SEQOENCE CHARACTERISTICS- 
(A) LENGTH: 19 base pairs 
<B) TYPE: nucleic acid 

<C) STRANDEDNESS : sdLngle 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
CAGCTCTCAT, TTTCCATAC 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unlcnovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAlSfENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
AGOCCCAAGA TOACTATC 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 



PCTAJS97/20444 



24 



18 



19 



18 



wo 98/20166 



-263 



(C) STRANDEDNESS : single 

(D) TOPOIOGY: unknom 

(ii) MOIiECDIiE TYPE: cDNA 

(iii) HYPOTHBTICMj: NO 

(iv) ANTISEHSE: NO 
(vj FRAOISNT TYPE: 
(vi) ORIGIN2VL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CX3AGGAGCTC AAGGCCAGAA T 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI»IESS : single 

(D) TOPOIiOGY: unknown 

(ii) MOIiECUIiE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMEt/rr TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
CAGGGGCAGC TCAGCTCTC 

(2) INPORMATICttI FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(±v) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
5GCA0GGCTG TCCAAGGA 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) I^NGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSfi: NO 

(v) FRAiCafBNT TYPE: 
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(vi) ORIGINAL SOURCE: 

(xi) SBQOENCE DESC3iIPTION: SBQ ID NO: 78 
AGGCCGCGCT CGGOGCCCTC 

20 

(2) INFORMATION FOR SEQ ID NO: 79: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STOANDEDNESS: single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(Hi) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
CTTACTTOAA TTCCAA6AGC 

(2) INFORMATION FOR SEQ ID NO:80: 

(i) SEQUENCE CHARACTERISTICS • 
itl fS!^- 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBEttlESS: single 

(D) TOPOLOGY: unknown 

^^^^l^^^COLE TYPE: cI»lA 
(lii) HYPOTHETICAL: NO 
(XV) ANTISENSE: NO 

(v) .FRAOfBNT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
GGGCTGACTT GCATGGACOG GA 

22 

(2) INFORMATION FOR SEQ ID NO:8l 

(i) SEQUENCE CHARACTERISTICS- 

it\ iSS^'' " pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOIHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAGMENT TYPE- 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
3CCAGGACA AG 
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(2) INFORMATION FOR SEQ ID NO:82: 

(1) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) - TOPOIOGY: unknown 

(ii) MOIiECUIiE TYPE: cDNA 

(iii) HVPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
ACAGCAGGAA CAGCA 

15 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: tankncnm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMEKT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
5CGGACATGG AGGAOGTG 

18 

(2) INFORMATION FOR SEQ ID NO: 84: 

(1) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIiOGY: unJcnom 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTOETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAlGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
ATGCOGATG ACCTGCAGAA G 

21 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 
(xii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAC^IENr TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
GTGCCCTGCA GCTTCACTGA AGAC 

24 

(2) INPORHATION FOR SEQ ID NO:86: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknovfn 

(ii) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
AGCCAGGACA AG 

(2) INFORMATION FOR SEQ ID NO: 87: 

fEQOENCE C3IARACTERISTICS - 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
AGCCAGGACA AGTC 

14 

(2) INFORMATION FOR SEQ ID NO:88: 
(i) SEQUENCE C3IARACTERISTICS • 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

^^^l ^IjBCOI^ TYPE: cDNA 
(ill) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAC9CENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
AGCCAGGACA AGA 

13 

(2) INFORMATION FOR SEQ ID NO: 89: 

(1) SEQDK NCE CHARACTERISTICS: 

(A) LENGTH; IS base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOIiOGY: iinknown 

(ii) MOLECOIiE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
ACAGCACCAA CAGCA 

15 

(2) INFORMATION FOR SEQ ID NO:90: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI»IESS : single 

(D) TOPOLOGY: unknown 

(iij MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
ACAGCAGGAA CAGCATC 

17 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 16 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: unkno%m 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEOTENCE DESCRIPTION: SEQ ID NO: 91: 
CAGCAGGAA CAGCAG 

16 
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(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOUXSYi unknown 

(ii) MOLECUIiE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(v> FRAGMENT TYPE: 
(vi) ORIGINAL SOORCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
GCGGACATGG AGGAOGTX3 

18 

(2) INFORMATION FOR SEQ ID NO: 93: " - 

(1) SEQUENCE CHARACTERISTICS - 
(A) LENGTH: 20 base pairs 
(B TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: untoown 

(ii) MOLECULE TYPE: cDNA 
(111) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAG3-13NT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
GCGGACATGG AGGACGTGGC 

C2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAG»(ENT TYPE: 

(vi) . ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
<X3GACATQG AGGACGTCC 

19 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUE NCE CHARACTERISTICS- 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STOANDEDNESS: single 
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(D) TOPOI/X3Y: unknown 

Jiii) HyPOTHETICAL: KO 
(ly) ANTISEaiSE: NO 
(v) FRACMENT TYPE- 
(Vi) ORiaiHAl. SOXJRCE: 

(xi) SEQDBilCE DESCRIPTION: SEQ ID NO; 95: 
^^^"^'^^TG ACXrroCAQAA G 

21 

(2) INFORMATION TOR SEQ ID NO;96:- 

^"^fivf^PSS?^ CHARACTERISTICS: 
tu\ 22 base pairs 

i^l »"<=lei<= acid 

STRANDEMJESS: single 
<D) TOPOLOGY: unknown 

^.^^l TYPE: cDNA 

(xy) AMTISENSE: NO 
<V> FRAtaiENT TYPE: 
(vx) ORIGIKAL SOORCE: 

(Xi) SBQOENCE DESCRIPTION: SEQ ID NO: 96: 
GATGOCGATG ACCTGCAGAA GC 

(2) INFORMATION TOR SEQ ID NO: 97: 

^'^kf^S'S^ CHARACTERISTICS: 
ln\ 23 base pairs 

in\ S^?* nucleic acid 
m SS^^'^^S- single 
(D) TOPOLOGY: unknown 

5f H . TYPE: cDNA 

(lii) HYPOTHETICAI.: NO 
(iy) ANnSBNSB: NO 
(V) FRAGMENT TYPE: 
(Vi) ORIGINAI. SOORCE: 

(Xi) SEQOKMCE DESCRIPTION: SEQ ID NO: 97: 
SATGCCGATG ACCTGCAGAA GTG 

(2) INFORMATION TOR SEQ ID NO: 98: 

^^Lf^S^^*^ CHARACTERISTICS- 
Jri 24 base pairs * 

(B) TYPE: nucleic acid 

^TRANDEDNESS: Single 
(D) TOPOIiOGY: unknown 

i^^l ^f^^-ECOhB TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
(iy) ANTISENSE: NO 
(v^l rRAfflfENT TYPE: 
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(vi) ORIGlNAIi SOURCE: 

(Xi) SBQPENCE DESCRIPTION: SEQ ID NO 198: 
<3TGCCCTGCA GCTTCACTGA AGAC 

(2) INFORMATION FOR SEQ ID NO: 99: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(iil) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
<v} FRAG»<ENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GTGCCCTGCA GCTTCACTGA AGACTO 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnovm 

(ii) MOLECULE TYPE: cDNA 
(111) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:100: 
^^TCCCCTGCA GCTTCACTGA AGACC 

(2) INFORMATION FOR SEQ ID NO: 101: 

^^^.^f^^JENCE CHARACTERISTICS- 

/«; f?*^^^- *>ase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

lflK**^^CULE TYPE: cDNA 
(lli) HYPOTHETICAL: NO 

(iv) ANTISENSE! NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
TATCTC3TTCA CTTGTCCCC 
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(2) INFORMATION FOR SEQ ID NO: 102; 

(1) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOI/XSY: unknown 

(ii) MOI*ECOLE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEHSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
CAGAGGCCTG GGGACCCTG 

19 

<2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOI£CUIfE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

ix±) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
AOGACAGGGC TGGTTCCC 

18 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovm 

(ii) MOI*ECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRACMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 
ACTGACAACC ACCCTTAAC 

19 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) I£NGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEMIESS: single 

WPE: cDNA 
(fix) HYPOTHETICAL s KO 
(IVJ ANTISENSE: NO 
(V) FRAGMENT TYPE- 
(vi). ORIGliiAL SOORCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO.IOS: 
^^''^(SCTTGCCA CAGGTCTC 

(2) INFORMATION FOR SEQ ID NO: 106: 
^^m^S^ CHARACTERISTICS: 
(B) TYPE: nucleic acid 
i^l SpANDEDNESS: single 
(D) TOPOIiOOY: xmkn<Mn 

5ffK'*°^^^°^ TYPE: CDNA 
(iix) HYPOTHETICAL: NO 
(ly) ANTISENSE: NO 
(V) FRAGMENT TYPE- 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106- 
CACAGCAGGC CAGTG7X3C 

(2) INFORMATION FOR SEQ ID NO: 107: 

^^(Af^SiS CHARACTERISTICS: 
lAf LENGTH: 19 base oairs 

J« fpAHDEDNESS: single 
CD) TOPOLOGY: unknot 

(xxi) HYPOTHETICAL: NO 
(ly) ANTISENSE: NO 
(V) FRA(3<BNT TYPE- 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 
SAOCTGATT TCCTTACTG 

(2) INFORMATION FOR SEQ ID NO: 108: 
^^(Af^^SI^ CHARACTERISTICS: 

irl V^^- *»"<:leic acid 
/m f^ANDKDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINAlj SOURCE: 

(xi) SEQDENCE DESCRIPTION: SEQ ID NO: 108: 
TCAATCTGAG GCATAACTX3 

19 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SBQOKNCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) T03P0L0GY: un3aio%m 

(ii) MOI*ECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) raAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xl) SEQUENCE DESCRIPTION: SEQ ID NO:109: 
TTX3CGTACAC ACTGGC06TC GTTTTACAAC GTCGTCACTG GGAAAACCCT 



(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(P) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FIUkGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
JTAAAACGAC GGCCAGTCTG TAOGCAA 

27 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)cno%m 

(ii) MOXjECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMOn* TYPE: 

(vi) CmiGINAL SOURCE: 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
TACTGGAAGG CQATCTCAGC AATCAGC 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQDENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TyPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIiOGY: iinknown 

(ii) MOI^COLE TYPE: cDNA 
(ixi) HYPOTHETICAL: NO 
(iv) ANTISEMSE: NO 
<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE. DESCRIPTION: SEQ ID NO:112: 
GGCAOGGCTG TCCAAGGAG 

(189) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQU ENCE CHARACTERISTICS- 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(Xli) HYPOTHETICAL: NO 
(XV) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
AOGCCGCGCT OGGCGCCCTC 

20 

<2) INFORMATION FOR SEQ ID NO: 114: 

<i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: RNA 
(ixi) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAC94ENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:XX4: 
30CACUACAG GOGAGCUCCA 



27 



(2) INFORMATION FOR SEQ ID NOrXXS; 
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<i) SEQO ENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOI/X5Y: unknown 

(ii) MOLECDLE TYPE: cDHA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEMSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GAATTCGAGC TCGGTACCOG G 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQOENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: Cl3NA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAjCaiENT TYPE: 

(vi) ORIGINAL SOTRCE: 

(xi) SEQOENCE DESCRIPTION: SEQ ID NO: 116: 
OOGGGTACOG AGCTOGAATT C 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unkno«ni 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SBCKIENCE DESCRIPTION: SEQ ID NO: 117: 
^CTCTTGGGA ACTGTGTAGT ATT 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQO ENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unlcnown 

(ii) MOLECULE TYPE: cDNA 
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(lix) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) OR1GIN3U, SODRCE- 



49 



(Xi) SEQUENCE DESCRIPTION: SEQ IDNO-118- 

iSS? JSSSS? SSSS^ f^^^ 

(2) INFORMATION FOR SEQ ID NO: 119- 

(1) SBQDENCE C3IARACTERISTICS : 

itl iSf*^= pairs 
(B) TYPE: nucleic acS 
<C> STRANDEWIKSS: single 
<D) TOPOLOGY: unknot 

(ii) MOUECDLE TYPE: cMJA 
(a-ii) HYPOTHETICAL: NO 

(IV) ANTISENSE: NO 

(V) FRAGMENT TYPE- 
(vi) ORIGINAL SOORCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO-llS- 

T^TACGACT CACTATAGGG CGAAGGCXGT CTCTCTCCCT C..:tcaTAC 

(2) INFORMATION FOR SEQ ID NO:120: 

^^/nf^HF^*^ CHARACTERISTICS: 

pairs 

(B) TYPE: nucleic acid 
iSi STRANDEDNESS: single 
») TOPOLOGY: unkno%m 

^**>I^CatE TYPE: cDNA 
(llA) HYPOTHETICAL: NO 
(xy) ANTISENSE: NO 
(v) FRA01ENT TYPE: 
(Vi) ORIGINAL SODRCE: 

(Xi) SBQDENCE DESCRIPTION: SEQ ID NO- 120- 

^ SJSS sssss -SSS S^SSS ,1? 

(2) INFORMATION FOR SEQ ID NO: 121: 

^''^/^f^^*^ CaaRACTERISTICS: 
Ir ^2 base pairs 

HPE: nucleic acid 
(C STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECDLE TYPE: cMOV 

(iii) HYPOTHETICAL: NO 
(ly) ANTISENSE: NO 
(V) FRAGMENT TYPE : 
(va) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: iSEQ ID NO:121: 
AATACTACAC AG 

(2) INFORMATION FOR SEQ ID NO:X22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECDIiE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
CTGATGCGTC GGATCATCTT TTTT 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
QATGATCCGA CGCATCAGAA TGT 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: tmknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
JATCTAGCTG GGCCGAGCTA GGCCGTTGA 

(2) INFORMATION FOR SEQ ID NO: 125: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) I^GTH: 27 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIiOGY: unknown 

(ii) MOIiECDIjE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 
CTGATGCGTC GGATCATCTT TT rrrrr 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 12 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovm 

TYPE: cDNA 
Jfii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 
GATGATCCGA CG 

.(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: IS base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : sdLngle 

(D) TOPOLOGY: tmkno%m 

Jff •^IfiCULE TYPE: cDNA 
Ciil) HYPOTHETICAL: NO 
(Iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTICMJ: SEQ ID NO: 127: 
GATGATCCGA CGCAT 

(2) INFORMATION FOR SEQ ID NO: 128: 

f^^tJHNCE CHARACTERISTICS- 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)cno%m 

(ii) MOLECULE TYPE: cONA 
(ili) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 

(v) FRAOfENT TYPE: 

(vi) ORIGINAI- SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 



(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) XJSNGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRTINDEDNESS : single 

(D) TOPOIiOGY: unkno%m 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAiOfBNT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 



(2) INFORMATION FOR SEQ ID NO:130: 

(i) SEQUE NCE CHARACTERISTICS: 
(A) LENGTH: 253 base pairs 
xB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FTtAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;130: 



(2) INPORMATICW FOR SEQ ID NO: 131: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) I^NGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)cno%m 

(ii) M0I£CULB TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



AT^AAAAGATG AT 



12 



GATCCGACGC AT 



12 




ACATGGAGGA 60 
GCCAGAGCAC 120 
GGCTCCTCCG 180 
GCGAGGGCGC 240 
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(xi) SBQOENCE DESCRIPTION: SEQ ID NO: 131: 

OAATTAOvoT cccAAcoGCG a^ou^o. Ac^ccGoc AA;vavcaxx.T roc^^ se 

(2) INFORMATION FOR SEQ ID NO: 132: 
f^UEl^aE CHARACTERISTICS: 

(B) TYPE: nucleic acid 
i^l 2?^ANDEDNESS: single 
(D) TOPOLOGY: unknown 

l^^l^^^CXlhB TYPE: cDNA 
(Xll) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAOIENT TYPE- 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO-132- 

AccArr^o ^r^ra^r xccTAo^a,, aatatag^uvc cgtov^ 

(2) INFORMATION FOR SEQ ID NO: 13 3: 
(i) SBQOENCE CHARACTERISTICS - 
(B) TYPE: nucleic acid 

/n S^5*®™^S= single 
(D) TOPOI«3y. unknown 

(il) MOLECOLE TYPE: cDHA 
(lii) HYPOTHETICAL: NO 
(ly) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAI. SOURCE: 

(Xi) SBQOENCE DESCRIPTION: SEQ ID NO: 13 3: 
CTATATTCAT CATAGGAAAC ACCAAAGAT 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SBQOENCE CHAHACTBRISTICS • 
5^5™= 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIOGY: unknown 

Jjji,»<^^^TYPE: CDNA 
(iii) HYPOIHETICAL: NO 
(ly) ANTISENSE: NO 
(V) FKAGMBNT TYPE- 
(vi) ORIGINAI. SOURCE: 

(xi) SEQDENCE DESCRIPTION: SEQ ID NO:134: 
TATATTCAT CATAGGAAAC ACCAAT 

2€ 

(2) INFORMATION FOR SEQ ID NO: 135: 
<i) SEQUENCE CHARACTERISTICS: 
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(A) liENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOIiOGY: unknonni 

(ii) MOIiECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTXSENSE: NO 

(v) FRAiOfENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQX7ENCE DESCRIPTION: SEQ ID NO: 135: 
CTATATTCAT CATAGGAAAC ACCAAAGAT 

(2) INFORMATION FOR SEQ ID NO: 136: 

(1) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: Cl»IA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 
-TATATTCAT CATAGGAAAC ACCAAAGATG ATATTTTC 

(2): INFORMATIC»J FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEt»IESS : single 

(D) TOPOLOGY: un)aiown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
rATATTCAT CATAGGAAAC ACCAATC ATATTTTC 

(2) INFORMATION FOR SEQ ID NO:138: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEC3NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 



35 



WO 98/20166 

PCrA;S97/20444 



-282- 

(iy) ANTISKMSE: NO 
(V) FRAGMENT TyPE- 
(Vi) ORIGINAI, SOTRCE: 

(Xi) SEQDENCE DESCRIPTION: SBQ ID NO: 138. 
CTATATTCAT OITAGGAAAC ACCAAAGATA TTTTC 

(2) INFORMATION TOR SEQ ID NO: 139.- 

^^(Af^iS <=«»RACTERISTICS: 
J a; I^NGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDHJNBSS: sinole 

(D) TOPOLOGY: unknoi^ 

<ii> MOLBCDLE TYPE: cDNA 
<lll) HYPOTHETICAI,: NO 
(ly) MJTISENSE: NO 
(V) PRAOfflSNT TYPE- 
(Vi) ORIGINAL SOURCE: 

(Xi) SBQDKNCE DESCRIPTION: SEQ ID NO-iag- 
CTATATTCAT CATAGGAAAC ACCAAAGATC C 

(2) INFORMATION FOR SEQ ID NO: 140: 

^"^rAf^PSSi^ CHARACTERISTICS: 
(A) LENGTH: 200 base pairs 

£?RAM>EDNESS: single 
(D) TOPOLOGY: unknoTO^ 

JffJ.'^SifS^TYPE: CDNA 
(lil) HYPOTHETICAL: NO 
Jir*.^^^^SE: NO 
(VJ FRAGMENT TYPE- 
(VI) ORIGINAL SOURCE: 

<Xi) SBQOENCE DESCRIPTION: SEQ ID NO:140: 

(2) INFORMATION FOR SBQ ID NO:141: 

*^(Af*^JS CHARACTERISTICS: 
/B> i^^'' *5 pairs 
Jol IH^- nucleic acid 

fpANDHDNESS: Single 
(D) TOPOLOGY: unknown 

(if) MOLECOLE TYPE: CMIA 
(fxi) HYPOTHETICAL: ^O^ 
(xy) ANTISENSE: NO 

(V) FRAGMENT TYPE- 

(VI) ORIGINAL SODRCB- 
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(Xi) SEQOENCE DESCRIPTION: SEQ ID NO: 141- 

SSSS?^ ^SSr "«T««c 

(2) INFORMATION FOR SEQ ID NO: 142: 

*^Lf^H^!l^ CHARACTERISTICS: 
it\ IS pairs 

= nucleic acid 
fTRANDBDNESS; single 



- • — 'u'w/nooa: sine 

(D) TOTOLOGFY: tinknown" 

Jff ? .'^^^COI^ TyPE: cDNA 
(lli) HYPOTHETICAL: NO 
(xv) ANTISENSE: NO 

(v) FRAOIENT TYPE- 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 
TOTACGTCAC AACTA 

(2) INFORMATION FOR SEQ ID NO:143: 

^^kf^S^ CHARACTERISTICS: 
Joi iSf^- pairs 
(B TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown^ 

Jff],**^I^CDLE TYPE: <a3NA 
Uli) HYPOTHETICAL: NO 
(xy) ANTISENSE: NO 
(vj FRAGMENT TYPE- 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
^<5TAOGTCAC AACTAC 

(2) INFORMATION FOR SEQ ID NO: 144: 

^"^Lf^PSS CHARACTERISTICS: 
inl l^^' pairs 
inl I^^' nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)aio%m^ 

TYPE: CDNA 
JxXi) HYPOTHETICAL: NO 
(xy) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vx) <miGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
TAOGTCAC AACTACA 

(2) INFORMATICS FOR SBO ID NO-wc 
(X) SEQUENCE CHARACTTOICTI^: ^ " = 
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(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STliANDEDNKSS : single 

(D) TOPOrOGY: un)cno%m 

(ii) MOLECOLE TYPE: cDNA 
Uli) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SODRCE: 

(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 145: 
TOTAOGTCAC AACTACAA 

(2) INFORMATION FOR SEQ ID NO: 146: 

(1) SEQUENCE CHARACTERISTICS- 

<B) TYPE: nucleic acid 
(C STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

iff ^.'^^^COLE TYPE: cDNA 

(III) HYPOTHETICAL: NO 

(IV) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 
TGTACGTCAC AACTACAAT 

(2) INFORMATION FOR SEQ ID NO: 147: 
(i) ^SEQUENCE CHARACTERISTICS- 

(B TYPE: nucleic acid 
(C STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

, TYPE: craiA 

(ill) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 
TGTAOGTCAC AACTACAATA 

(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS- 
5»! iSf^- 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinale 

(D) TOPOLOGY: unknown 

TYPE: C!DNA 
(111) HYPOTHETICAL: NO 
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(iv) ANTISEMSE: NO 

(v) FRA01ENT TYPE: 

(vi) ORIGINAL SOORCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 
TGTAOGTCAC AACTACAATA G 

(2) INFORMATION FOR SEQ ID NO:149: 

(1) SEQD BNCE CHARACTERISTICS: 

(A) I-ENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHBSS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECOLE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
ICTACGTCAC AACTACAATA GG 

(2) INFORKATION FOR SEQ ID NO: 150: 

(i) SBQOS NCE CHARACTERISTICS - 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un}cno%m 

(ii) MOLECULE TYPE: cDNA 
(Hi) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DBSCRIPTIC»r: SEQ ID NO: 150: 
3TACGTCAC AACTACAATA GGC 

(2) INFORMATK^ FOR SEQ ID NO: 151: 

(i) SEQUE NCE CHARACTERISTICS- 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)cnown 

(ii) MOLECULE TYPE: cI»IA 
(ili) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMSIT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTIW: SEQ ID NO: 151: 
rAOGTCAC AACTACAATA GGCC 
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(2) INFORMATION FOR SEQ ID NO: 152: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 
XD) TOPOIiOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOiENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152; 
TGTACGTCAC AACTACAATA GGCCC 

(2) INFORMATION FOR SEQ ID NO: 153: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknoim 

(ii) MOLECULE TYPE: cDNA 
(lli) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 
TGTACGTCAC AACTACAATA GGOOCT 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEMIESS: single 

(D) TOPOLOGY: un)mo%m 

(ii) MOLECULE TYPE: cDNA 
(Ail) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOffiNT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:154: 
raTACGTCAC AACTACAATA GGCCCTG 

(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDETttlESS : single 
(U) TOPOLOGY: unknown 
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(ii) fKJLECOLE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRACTIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 155: 
TGTACGTCAC AACTACAATA GGCCCTCC 

28 

(2) INFORMATION FOR SEQ ID NO:X56: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cWA 
(ili) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
TGTACGTCAC AACTACAATA GGOCCTGCA 

29 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unlcnovm 

(ii) MOIfCULE TYPE: cONA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOfENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 
rOTACGTCAC AACTACAATA GGCCCTOCAC 

30 

(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: cTOlA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAiOCENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 15 8: 
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TGTACGTCAC AACTACAATA GGCCCTGCAC C 

(2) INFORMATION FOR SHQ ID NO: 159: 

(1) SEQUENCE CHARACTERISTICS- 
tt\ ^2 base pairs 
(B) TYPE: nucleic acid 

In S^S^™*^S: single 
(D) TOPOLOGY: unknown 

5 ' •'OI^CDLE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE- 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 
TGTACGTCAC AACTACAATA GGCCCXX3CAC CA 

(2) INFORMATION FOR SEQ ID NO:160: 

(1) SBQOEKCE CHARACTERISTICS: 
Ir ^3 *«se pairs 
inl I^^' acid 

(C) STRANDEDMESS: single 

(D) TOPOLOGY: unknown 

{ffL"°^^^°^ TYPE: CDNA 
Jiii) HYPOTHETICAL: NO 
Uv) ANTISENSE: KO 
(V) FRAOffiWr TYPE- 
<vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 160- 
TGTACGTCAC AACTACAATA GGCCCTCCAC CAG 

(2) INFORMATION FOR SEQ ID NO: 161: 

*^iaf^J^^ CHARACTERISTICS: 
S! ^I^^'- ^* pairs 
Si IB^- nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

TYPE: CDNA 
(ill) HYPOTHETICAL: NO 
(ly) ANTISENSE: NO 
(V) FRAGMENT TYPE- 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 
GTACCTCAC AACTACAATA GGCCCTGCAC CAGG 

(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS - 
i^?™= 35 base pairs * 
(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOI/XTY: unknovm 

(ii) MOIiECOLE TYPE: CDKTl 

(iii) HYPOTHETICAL: NO 

(iv) ANTXSEMSE: NO 

(v) FRAOiENT TYPE: 

(vi) ORIGIKAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 
TGTAOGTCAC AACTACAATA GGCCCTGCAC CAGGC 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SBQOK NCE CHARACTERISTICS: 

(A) IjENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIiOGY: unknown 

<ii> M0I<BCQI<E TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISraiSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
TGTACGTCAC AACTACAATA GGCCCIXSCAC CAGGCC 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: SjLngle 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
OGTAOGTCAC AACTACAATA GGCCCXX3CAC CAGGCCA 

(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)cnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FTUVGMBNT TYPE: 
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(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 
TCTAOGTCAC AACTACAATA QGCCCTC3CAC CAGGCCAG 

(2) INFORMATION FOR SEQ ID NO: 166: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 
Uxi) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOIENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 
TGTAOGTCAC AACTACAATA GGCCCTGCAC CAGGCCAGA 

39 

(2) INFORMATION FOR SEQ ID MO: 167: 

(1) SEQUENCE CHARACTERISTICS - 

/ni i^^- pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(j-ii) HYPOTHETICAL: NO 
(iy) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 
CTGATGOGTC GGATCATCC 

(2) INFORMATION FOR SEQ ID NO: 168: 

<i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(lii) HYPOIHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINTIL SOURCE: 

(xi) SBQOTNCE DESCRIPTION: SEQ ID NO:168: 
^TCATGOGTC GGATCATOCA 

(2) INFORMATION FOR SEQ ID NO: 169: 



19 



wo 98/20166 



PCT/US97/20444 



-291- 



(i) SEQOEKCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: unknown 

(ii) MOI*ECUIiE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:169: 
CTGATGCGTC GGATCATCCA G 

(2> INFORMATION FOR SEQ ID NO:170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAC91ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 
CTGATGCGTC GGATCATCCA GC 

(2) INFORMATION FOR SEQ ID NO:171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLCX3Y: unlcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTIS3ENSE: NO 

(v) FRA<3MENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 
CTGATGCGTC GGATCATCCA GCA 

(2) INFORMATION FOR SEQ ID NO:172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGa^ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 
CTGATGCXSTC GGATCATCCA GCAG 

24 

(2) INFORMATION FOR SEQ ID NO: 173: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: cDNA 
Uli) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:173: 
CTGATGCGTC GGATCATCCA GCAGC 

(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
CTGATGCGTC GGATCATCCA GCAGCA 

26 

(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 
it\ 27 base pairs 

(B) . TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: xmknown 

(ii) MOLECULE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) PRAC»1ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:175: 
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CTGJATGCX5TC GGATCATCCA GCAGCAG 

(2) INFORMATION FOR SEQ ID NO: 176: 

(1) SEQUENCE CHARACTERISTICS: 

(A) liENGTTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: iinknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOIHETTCAL: NO 

(iv) ANTISENSE: NO 

(v) FRAiG^iENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 
CTGATGCGTC GGATCATCCA GCAGCAGC 

(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SBQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAiC^IENT TYPE: 

(vi) ORIGINAL SOtJRCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 
CTGATGCGTC GGATCATCCA GCAGCAGCA 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQU KNCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOfENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 
:nX3ATGCGTC GGATCATCCA GCAGCAGCAG 

(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOIiOOT: unknown 

fff> MOI^CDLE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 
CTGATGCGTC GQATCATCCA GCAGCAGCAG C 

31 

(2) INFORMATION FOR SEQ ID NO: 180: 

(1) SEQUENCE CHARACTERISTICS- 
<A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDMA 
(Hi) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:180: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CA 

(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

Jff> MOI^CULE TYPE: cDMA 

(iii) HYPOTHETICAL: 1K> 

(iv) ANTISENSE: NO 

(v) FRAG^4ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAG 

(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 
CTGATGCGTC QGATCATCCA GCAGCAGCAG CAGC 

(2) INFOR^[ATION FOR SEQ ID NO: 183: 

(1) SEQUENCE CHARACTERISTICS- 

(A) I-ENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: cDNA 
(iix) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(vl) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:183: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCA 

(2) INFORMATION FOR SEQ ID NO: 184: 

(1) SEQUENCE CHARACTERISTICS- 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

Jff?,*^^COLE TYPE: cDNA 
UIX) HYPOTHETICAL: NO 
(XV) ANTISENSE: NO 

(v) FRAGMENT TYPE- 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAG 

(2) INFORMATION FOR SEQ ID NO: 185: 
^^Lf^"^^ CHARACTERISTICS: 

itl pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRACa^ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 
:TX3ATG0GTC GGATCATCCA GCAGCAGCAG CAGCAGC 

(2) INFORMATION FOR SEQ ID NO:186: 
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(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 38 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; unJcnown 

(ii) MOLECULE TYPE: cDNA 
Uxi) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:186: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCA 

(2) INFORMATION FOR SEQ ID NO: 187: 
^^/^f^"^^^ CHARACTERISTICS: 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

i^^l^^^COLE TYPE: cDNA 
(ili) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRACMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION. SEQ ID NO:187: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAG 

(2) INFORMATION FOR SEQ ID NO: 188: 

^^Lf^"^^ CHARACTERISTICS: 
m " pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

^^^i J^l^CaiM TYPE: CDNA 
Jiii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOIENT TYPE- 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:188: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC 
(2) INFORMATION FOR SEQ ID NO:189: 
(i) SEQUENCE CHARACTERISTICS- 

in\ I^^' '^^<=l«ic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cI»lA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOtJRCE: 

(xi) SBQtJENCE DESCRIPTION: SEQ ID NO: 189: 
CTQATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC A 
(2) INPORHATION FOR SEQ ID NO: 190: 

(1) SEQDE NCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOXiECDLE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:190: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AG 
(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQOK NCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(if) MOhECXTLE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAG»4BNT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 191: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGT 

(2) INFORMATION FOR SEQ ID NO: 192: 

(1) SEQUENCE CHARACTERISTICS: 

(A) IiBNGTB: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:192: 
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CTGATGCGTC GGATCATCCA GCAC3CAGCAG CAGCAGCAGC AGTC 
(2) INPORMATION FOR SEQ ZD NO:193: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOI/X3Y: unJcnown 

Jf f K'*^^^^^^'^ TYPE: CDMA 
Jill) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOORCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:193: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCA 
(2) INFORMATION FOR SEQ ID NO: 194: 

(1) SEQUENCE CHARACTERISTICS- 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: unJcnovm 

fff? .'^^^CULE TYPE: cDNA 
(111) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOIENT TYPE- 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCAC 

(2) INFORMATION FOR SEQ ID NO: 195: 

^^^-^f^^JENCE CHARACTERISTICS: 

(A) LENGTHS 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEWIESS: single 

(D) TOPOLOGY: unJcncwn 

(ii) MOLECULE TYPE: cDNA 
(lli) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRACSfENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 
^TGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACG 

(2) INFORMATION FOR SEQ ID NO:196: 
(i) SEQUENCE CHARACTERISTICS- 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOIiOGfY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04BNT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 
CTGATGOGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCAOGC 
(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAiGMENT TYPE: 

- (vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCT 
(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEMIBSS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) PBAGMBNT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 
CTCSATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA 
(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJoiown 

(ii) MOLECULE TYPE: cDNA 
(Iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOIENT TYPE: 
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(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA A 
(2) INFORMATION FOR SEQ ID NO: 200: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sdLngle 

(D) TOPOLOGY: xinknown 

(ii) MOLECULE TYPE: cDNA 

(III) HYPOTHETICAL: NO 

(IV) ANTISENSE: NO 
(v) FRA^IENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA AC 
(2) INFORMATION FOR SEQ ID NO: 201: 

(1) SEQUE NCE CHARACTERISTICS- 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOIHNT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 01: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACC 

(2) INFORMATIC»^ FOR SEQ ID NO:202: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unkno%m 

^.^il^^^OmjE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCG 

(2) INFORMATION FOR SEQ ID NO: 203: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) IiEMGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDKESS : single 

(D) TOPOIiOCT: unknown 

(ii) MOIiBCOLK TYPE; cDHA 
(ili) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
<V) FRAlG^IBNT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 
CTGATC3CGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGA 55 

(2) INFORMATION FOR SEQ ID NO: 204: 

(1) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNKSS : single 

(D) TOPOLOGY: unknown 

(il) »40LECULE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 
(xv) ANTISENSE: NO 

(v) FRAi^4BNT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:204: 
CTGATGOGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAA se 
(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAiC3<ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 
rTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAAT 57 

(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOMGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) FRAOIENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 06: 

CTt^ATXKrOTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTXZACGCTA ACCGAATC SB 
(2) INFORMATION FOR SEQ ID NO: 207: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLiECULE TYPE: cDNA 
(ixi) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
<v) FRAOIENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

'^''^ <^rc^rccA <^oa^ 

(2) INFORMATION FOR SEQ ID NO:208: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 60 i>ase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

<^ax^ <^«:,,^ ccA<x:;«^ 

(2) INFORMATION FOR SEQ ID NO:209: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

5ff?.**^^C01^ TYPE: CDNA 
(iix) HYPOTHETICAL: NO 
(XV) ANTISENSE: NO 

(v) FlUU^lENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:209: 
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CTGATCKXn.: GGATCATCCA OC;VGCAGCAG 

(2) INFORMATION FOR SEQ ID NO:210: 

(i> SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOIXXfY: imknown 

TYPE: CDMA 
(ili> HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE* 
(Vi) ORIGINAL SOORCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID n6:210: 
CTC».TGOGTC a<^«:;VTCa, CC;U^^ 

(2) INFORMATION FOR SEQ ID NO: 211: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: €3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: unknovm 

(ii) M0LECDL3 TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

<V) FRA04ENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTI<»J: SEQ ID NO:211: 
CTGAlXKXm: GGATCATXrCA G^ 

^3 

12) INFORMATICH7 FOR SEQ ID NO: 212: 

^^\^f^^^CE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

MOI^CULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) WRAGHEtn TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:212: 
^TX^X^TX: GGATCATCCA GCAGCAGCAG CAGCAG^ 

(2) INFORMATICS FOR SEQ ID NO:2l3: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIOGY: unknown 

(ii) MOUSCUIiE TYPE: cONA 

(iii) HYPOTHETICAL: NO 
(xv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 13: 
J^OATGOtnr GGATCATCCA <^«^ 

65 

(2) INFORMATION FOR SEQ ID NO: 2 14: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnovm 

(ii) MOLECULE TYPE: cDNA 
Uli) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 
CTGATGCG 

66 

(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS- 

Jo! ilSf^* pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinale 

(D) TOPOLOGY: unknotm" 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

67 

(2) INFORMATION FOR SEQ ID NO:216: 

(i) SEQUENCE CHARACTERISTICS- 

\t\ ^« pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



yfO98n0l66 



PCT/VS97/20444 



-305- 



(D) TOPOLOGY: unknown 

(iii) HYPOTHETICAL: NO 
(±y) ANTISKNSB: NO 

(V) frajc^iknt type- 

(vl) ORIGINAL SOORCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:216: 
SgJ^ <^T^TCCA GCAGCAGCAC 

(2) IKPORMATION FOR SEQ ID NO:217: 

^^Lf^H?*^^ CHARACTERISTICS: 
Im S9 base pairs 

nucleic acid 
/«? STRANDEDNKSS: single 
(D) TOPOI^Y: unknown 

JfH.**>^'BCDI^ TYPE: cDNA . . 

(xil) HYPOTHETICAI.: no 
(iy) ANTISBITSE: MO 
(V) FRA<9(BNT TYPE: 
(vi) ORIGIHAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217- 

<^'^'rcc^ oc.^<^^ 

(2) INFORMATION FOR SEQ ID NO:218: 

(i) SEQUENCE CHARACTERISTICS: 
S base pairs 

i^l nucleic acid 

/m STOANDEDNESS: single 
<D) TOPOLOGY: untoown 

'^^;?£DIfT3fPB: CDNA 
Jili) HYPOTHETlCALi NO 
(ly) ANTISENSE: NO 
jv) FRASMENT TYPE- 
(Vi) ORIOINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:218. 
SgJSSS «^TCATCCA <K:A0c^ 

(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

/o! 12?^* nucleic add 
JC) STRANDEDNESS: single 
<D) TOPOUXSY: unknown 

(if) MOLBCUI.E TYPE: cDNA 
»"-iJ HYPOTHETICAI,: NO 



PCTAJS97/2<M44 



-306- 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO:219: 
JJgATGCG^ 

(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQOENCE CHARACTERISTICS- 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: unknown 

(ii) MOLECDLE TYPE: cDNA 
(Hi) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOffiNT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTICW^: SEQ ID NO:220: 
S^^^TGC^ 

72 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(XV) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 
TGCACCTGAC TCC 



(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS- 

(A) . LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(fix) HYPOTHETICAL: NO 
(XV) ANTISENSE: NO 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 
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TCCACCrrGAC TCCT 

14 

(2) INFORMATION FOR SBQ ID NO: 223: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECOLE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 
TGCACCTGAC TCCTG 

15 

(2) INFORMATION FOR SEQ ID NO:224: 

(1) SEQUKyCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDMA 
(iii) HYPOTHETICAL: NO 
(xv) ANTISENSE, NO 

(v) FRAiGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 24: 
TGCACCTGAC TCCTGT 

16 

(2) INFORMATION FOR SBQ ID NO:22S: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:225: 
TOCACCTGAC TCCTX3TG 

17 

(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: tmknown 

(^^^i^tjBCULE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAQ4ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 
TGCACCTGAC TCCTGOXSG 

18 

(2) INFORMATION FOR SEQ ID NO: 227: 

(1) SEQUENCE CHARACTERISTICS- 
itl pairs 
(B) TYPE: nucleic acid 

(C STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

TYPE: CDNA 
(xil) HYPOTHETICAL: NO 
(iy) ANTISENSE: NO 
(V) FRA<34ENT TYPE: 
(vl) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:227: 
TGCACCTGAC TCCTGTOGA 

(2) INFORMATION FOR SEQ ID NO:228: 
^^L?^^^ CHARACTERISTICS- 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

Jlii) HYPOTHETICAL: NO 
(±V) ANTISENSE: NO 

(v) FRA04ENT TYPE- 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:228: 
TGCACCTGAC TCCTGTGGAG 

(2) INFORMATION FOR SEQ ID NO: 229; 
(i) SEQUENCE CHARACTERISTICS - 

(B) TYPE: nucleic acid 



20 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)cnovm 

(ii) MOLECULE TYPE: cDNA 
(Jil) HYPOTHETICAL: NO 
(iy) ANTISENSE: NO 
(v) PRAOIENT TYPE- 
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(vi) ORIGINAIj SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID KO:229: 
TGCACCTGAC TCCTGTGGAG A 

(2) INFORMATION FOR SEQ ID NO: 230: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
<B) T3fPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTTSENSE: NO 

(v) FRA(3«ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 
TGCACCTGAC TCCTGTGGAG AA 

(2) INFORMATION FOR SEQ ID NO:231: 

(i) S3SQUK NCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) S^TOANDEDNESS : single 

(D) TOPOLOGY: unkno%m 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEHSE: NO 

(v) FRAGHBHT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 
TGCACCTGAC TCCTGTGGAG AAG 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknom 

(ii) MOLECULE TYPE: cDNA 
(Iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:232: 
TGCACCTGAC TCCTGTGGAG AAGT 

(2) INFORMATION FOR SEQ ID NO:233: 
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(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: imknovm 

(ii) MOLECOLE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEMSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 3: 
TGCACCTGAC TCCTGTGGAG AAGTC 

25 

(2) INFORMATION FOR SEQ ID NO:234: 

(1) SEQUENCE CHTUIACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEttlESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
Uxi) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 
TGCACCTGAC TCCTGTGGAG AAGTCT 

26 

(2) INFORMATION FOR SEQ ID NO:235: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcno%m 

(ii) MOLECULE TYPE: cDNA 
Uii) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 
TGCACCTGAC TOCTGTGGAG AAGTCtG 

27 

(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 28 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unlcnown 

(ii) MOLECULE TYPE: cDNA 



wo 98/20166 



PCTAJS97/20444 



-311- 



(iii) HYPOTHETICAL: HO 

(iv) ANTISENSE: HO 
(V) FRACa^ENT TYPE; 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 
TGCACCTGAC TCCTC3TGGAG AAGTCTGC 

28 

(2) INFORMATION FOR SEQ ID NO:237: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: un3cnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237; 
TGCACCTGAC TCCTQTGGAG AAGTCTGCC 

29 

(2) INFORMATION FOR SEQ ID NO:23B: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(lli) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 
rOCACCTGAC TOCTGTGGAO AAGTCTOCCG 

30 

(2) INFORMATION FOR SEQ ID NO:239: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) liENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

^,^^1 ^^^-BXXJVE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 
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TGCACCTGAC TCCTX3TGGAG AAGTCTGCOG T 

31 

(2) INFORMATION FOR SEQ ID NO: 240: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEtttJESS : single 
(D) TOPOIjOGY: iinknown 

(ii) MOLECUIiE TYPE: cDNA 
Uii) HYPOTHETICAL: NO 
(XV) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

TGCACCTGAC TCCTGTGGAG AAGTCTCCCG TT 

(2) INFORMATION FOR SEQ ID NO:241: 

<i) SEQUENCE CHARACTERISTICS: 
itl iS^^^' 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(fxi) HYPOTHETICAL: NO 

(iv) ANTISENSE: IK> 

(v) FRAGMENT TYPE- 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 
TGCACCTGAC TCCTGTGGAG AAGTCTCCCG TTA 

(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQU ENCE CHARACTERISTICS- 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

^^^^l^^^COLB TYPE: cDNA 
Uxi) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:242: 
TGCACCTGAC TCCTGTGGAG AAGTCTOCCG TTAC 

(2) INFORMATION FOR SEQ ID NO:243: 
(i) SEQUENCE CHARACTERISTICS- 
(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOIiECOLE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRACfffENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 
TCCACCTOAC TCCTGTGGAG AAGTCTGCOG TTACT 

(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: iinknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCOG TTACTG 

(2) INFORMATION FOR SEQ ID NO:245: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: tmknown 

(ii) MOLBCUIiE TYPE? cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 5: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCOG TTACTGC 

(2) INFORMATION FOR SEQ ID NO:246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unlmovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: 1K5 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:246: 
TGCACCTGAC TOCTGTGGAG AAGTCTOCCG TTACTC3CC 

(2) INFORMATION FOR SEQ ID NO:247: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
Uii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
(V) FRAC3WENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTCCCC 
(2) INFORMATION FOR SEQ ID NO: 248: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(a-ix) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRACaiENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG rTTACTCCCCT 

(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEttlESS : single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(XV) ANTISENSE: NO 

(v) FRAOfENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 
roCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTXSCCCT G 

(2) INFORMATION FOR SEQ ID NO: 250: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiCXJY: tmknown 

(ii) MOLECOIjE TYPE: cDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAIi SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GT 
(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs * 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: uhknom 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:251: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTG 
(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unlcnown 

(ii) MOLECULE TYPE: cDNA 
(Iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 
X5CACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTCG 

(2) INFORMATION. FOR SEQ ID NO:253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEJK5TH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unlcnown 

(ii) MOLECULE TYPE: cONA 
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Uix) HYPOTHETICAL: NO 
(XV) ANTISEKSE: NO 

(v) FRAOtBNT TYPE- 

(vi) ORIOIMRL SOORCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 
«3CACCTGAC TCCTGTGGAG AAGTCK3CCG TTACTCCCCT GTCGG 
(2) INFORMATION FOR SEQ ID NO: 254: 

^"^laf^SF^JE? CHARACTERISTICS: 
Jm iSS^- ■»« pairs 
(B TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJaiown 

Jff?.»*OI^C0LE TYPE: cDNA 

(III) HYPOTHETICAL: NO 

(IV) ANTISENSE: NO 

(V) FRAiGMBNT TYPE- 
(vi) ORIGINAL SOORCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:254. 

'^^CCT^AC TCCTGTGGAG AAGTCTOCCG TTACIX5CCCT G1XK5GG 
(2) INFORMATION FOR SEQ ID NO:25S: 

^^Lf^H?*^ CHARACTERISTICS: 

)°; TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknoim^ 

(ii) hKJLBCOLE TYPE: CDNA 
(111) HYPOTHETICAL: NO 
(iy) ANTISENSE: NO 
(V) FRAiCaiENT TYPE- 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 255- 
^CCX«AC TCCTGTGGAG AAGTXrxGCCG TTACXXJCCCT <,rOGOGC 
(2) INFORMATION FOR SEQ ID NO: 256: 

^^kf^SSJSl CHARACTERISTICS: 
itl i^^'- *8 base pairs 
(B TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown^ 

JfJ^i.'^CDLE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(ly) ANTISENSE: NO 
(v| FRAGMENT TYPE- 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:256: 
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TGCACCTGAC TCCTGTCCy^G AAGTCTCCCG TTACTX3CCCT GTCGGGCA 
(2) XNFORMATION FOR SEQ ID NO: 257: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOJjOGY: unJoiown 

5f f > MOI-BCDIiE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SODRCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:257: 
TGCACCTGAC TCCTGTGGAG AAGTCTCCCG TTACTX5CCCT GTGGGGCAA 
(2) INFORMATION FOR SEQ ID NO: 258: 

(1) SEQU ENCE CHARACTERISTICS: 

}fF^^' so pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

Jff?.**^^*®^^™^ TYPE: c£»IA 
<ili) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
^CCTGAC TCCTGTGGAG AAGTCTCCCG TTACTCCCCT GTX3GGGCAAG 

(2) INFORMATION FOR SEQ ID NO:259: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRA04ENT TYPE: 

(vi) <MtIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:259: 
-^CXrTGAC TCCTGTGGAG AAGTCTX3CCG TTACTCCCCT GXX5GGGCAAG G 

(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOIOGY: unknown 

(li) MOIiECOliE TYPE: cDNA 
(ili) HYPOTHETICAL: NO 

(iv) ANTISEMSE; NO 

(v) FRA04ENT TYPE; 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:260: 

^CCTGAC TCCXX3TGGAG AAGTC1X3CCG TTACTOOCCT GIGGGGCAAG GT 

(2) INFORMATION FOR SEQ ID NO:261: 

(i) SEQUENCE CHARACTERISTICS- 
ftl l^E^' 209 base pairs' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 



SS?^ S^gs^ ^^^^ 

-rOOTGCTGAG SSSS?!?? fS^^S^ AAGGTGAACG TCgI^SS? 12^ 

TAGAAACTGG GCAiaiS^ S^^^ AAGGTTACAA GACAGGTTTA AGGAgXS^S ^80 

209 

(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS- 

i^l J:^^' ^® pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

5f5L"^^^^^°^ TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

60 

88 

(2) INFORMATION FOR SEQ ID NO:263: 

^^^^^fEQUENCE CHARACTERISTICS- 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ll) MOIiECOLE TYPE: cDNA 
ail) HYPOIHETICAL: MO 
(Iv) ANTISKNSE: NO 
( V) FRA04ENT TYPE: 
(vl) ORIGINAL SOORCB: 

(Xl) SBQOENCE DESCRIPTION: SEQ ID NO:263: 

^STGTGTT aTTTTTTAAC AGGGATTTGG GGAATTATTT GAGA 
(2) INFORMATION FOR SEQ ID NO:264: 

^'^Lf^S^ CHARACTERISTICS: 
W I.ENGTH: 24 base pairs 
(B TYPE: nucleic aci^ 

STRANDEDNESS: single 
(D) TOPOLOGY: unkno^^ 

(ii) TOLECDLE TYPE: cDNA 

iXv) AHTISENSE: NO 
(V) FRAOIENT TYPB- 
(VI } ORIGINAL SOURCE: 

(Xi) SEQDEMCE DESCRIPTION: SEQ ID NO:2«. 
"'«»=AAAT CCCTC3TTAAA AAC 

(2) INFORMATION FOR SEQ ID NO:26S: 

^^(Af^^lSi CHARACTERISTICS: 
lAJ I^GTH: 26 base pairs 
(B) TYPE: nucleic acid 
Jnl ^PMTOEDNESS: single 
(D) TOPOLOGY: unkno%S 

J iii) HYPOTHETICAL: NO 
(ly) ANnSENSE: NO 
JV) FRAfiMKMT TYPE: 
(vi) QRIGIKAL S<XIRCB: 

(XI) SEQDENCE DESCRIPTION: SEQ ID NO:265. 
"T^^CCAAAT CCCTCTTAAA AAAAC 

(2) INFORMATION FOR SEQ ID NO:266: 

^^kf^^S^ CHARACTERISTICS: 
Si 27 base pairs 

H?^- nucleic acid 
fTRANDEDNESS: sinale 
(D) TOPOLOGY: unfcnov^^^ 

(ill) HYPOTHETICAL: MO 
(ly) ANTISEMSE: NO 

(V) FRA04BNT TYPE: 

(VI) ORIGINAL SOORCE- 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
TTCCCCAAAT CCCTGTTAAA AAAAAAC 

27 

(2) INFORMATION FOR SEQ ID NO: 267: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOI/>3Y: xinknown 

^.^P, ^^Ol^ECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
<v) FRAOIENT TYPE: 

<vi) ORIGINAL SOURCE: " 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:267: 

S5™ " 

103 

(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: tinknown 

(ii) MOLECULE TYPE: cDNA 
(111) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAC^fBNT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO :26a: 

CAAAOU^ CCATGGT.^ CCX^CTCCT OACGAaKCTO eo 

78 

(2) INFORMATION FOR SEQ ID NO: 269: 

^^^^^f^^JENCE CHARACTERISTICS: 
;»l 5^^' '^^ *^se pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)cnown 

(ii) MOLECULE TYPE: cDNA 
(Hi) HYPOTHETICAL- NO 

(iv) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:269: 
^•C^^^TGTXrrC CACATGCCCA GNCTCCTCAG GACTCAGGTG CACAOGGOGT Cnn^CG 60 
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CATGGCGCAC CTOAGCTC 

(2) IHTORMATION FOR SBQ ID NO:270: 

In £™ANDEDlJESSi single 
(D) TOPOLOGY: unknown 

J'^y A»T1SKNSE: MO 
(V) FKAOfEMT TYPE- 
(Vi) ORlanou, SOURCE: 

^^^^^^^^) SEQOEMCE DESCRIPTION: SEQ ID KO:270: 

(2) INTORMAXION FOR seQ ID NO:271. ''^ 

m ^^^=DNESS: single 
(D) TOPOIOOY: unkno^m 

5ff|,*^2J^£|?^ TYPE: cDNA 

HYPOTHETICAL- NO 
(xy) ANTISENSE: TO 
(V) FRAGMENT TYPE^ 
(va) ORIGINAI. SOORCB: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:271: 

^^^^^ SSSS^ S'^'^^^ 

(2) INFORMATION FOR SBQ ID NO:272: 
^-Sef^^lcft'" 

Civ) ANTISBNSB: NO 
Jv) FRAaiBNT TYPB- 
(VI) ORIGINAI. SOO^CE: 

<Xi) SEQOENCE DESCRIPTION: SEQ ID NO-272. 

^^^^^^ CACAXGcocA gnc.cc«:ag cao^:;.^ ec ' 

(2) INFORMATION K)R SEQ ID NO:273: " 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS : single 

(D) TOPOLOGY: xinkncwn 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA01ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:273: 
CACCTGACTC CTA 

(2) INFORMATION FOR SEQ ID NO:274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(xxi) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAiaffiNT TYPE: 

(vi) ORIGINTIL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 74: 
CACCTGACTC CTGGA 

/^A 14 
12} INFORMATION FOR SEQ ID NO:275: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 14 base pairs 
(B TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

Jff> WO^CULE TYPE: cDNA 
(111) HYPOTHETICAL: NO 
Uv) ANTISENSE: NO 

(v) FR7VGMBNT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 75: 
CACCTGACTC CTGA 

/I 14 
(2) INFORMATION FOR SEQ ID NO:276: 

(i) SEQUENCE CHARACTERISTICS- 
(A LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(Hi) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 

(v) FRA^IENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 76: 
CCATGGTGTC TGTTTGAGGC ATCGOG 

26 

(2) INFORMATION FOR SEQ ID NO: 2 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

SSS^^'''^* ^^^^'^ 60 

75 

(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 61 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAlGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

ACCTAGCGTT CAGTTCGACT GAGATAATAC GACTCACTAT AGCAGCTCTC ATPTTCCATA 60 

61 

(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) IjENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: imJcnown 

(ii) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOfENT TYPE: 

(vi) <MIIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 
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GUCACOACAG GOGAGCacCA 

(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRACMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 
CTCAGTCCAC GTGGTACCCT GCTG 

24 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUE NCE CHARACTERISTICS- 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unkno%m 

^,^^\^^^CUL^ TYPE: cDNA 
(111) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

S^S?^ gSS^^^ GCAACCTCAA ACAOAC^^ccA ^tX^G^cc^ 

85 

(2) INFORMATION FOR SEQ ID NO:282: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: 
.CX3GGTCCCG GAGTGGTGTC GC 

22 

(2) INFORMATION FOR SEQ ID NO 283- 
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(i) SEQUE NCE CHftRACTERISTICS : 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGfy: unknown 

(ii) MOr^COIiE TYPE: tDHOi 

(iii) HyPOTHETICSUi: NO 

(iv) ANTISENSE: NO 

(v) FRAGMEtrr TYPE: 
<vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

^S^SSS ?J^S^ ^^'^'^^ .0 

(2) INFORMATION FOR SEQ ID NO:284: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENOTH: 76 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cIMlA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISBNSE: NO 

(v) FRAlCaiENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:284: 

SSS^ ;jgS^ ^'^'^'^ eo 

76 

(2) INFORMATION FOR SEQ ID NO:285: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:285: 

^SSS^ ^'^'"^^ «o 

76 

(2) INFORMATION FOR SEQ ID NO:2e6: 

(i) SEQUE NCE CHARACTERISTICS- 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: unknown 

(ii) MOLECDIiE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:28€: 

SSS^^ ^^^^^^^ ,0 

(2) INFORMATION FOR SEQ ID NO:287: 

^^^.^fEQDENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
Uli) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 
ACAGGTTTAA GGAGACCAAT AGAAACOXSGG CATCTGGAGA CAGAGAAG 
(2) INFORMATION FOR SEQ ID NO:288: 

(i) SEQUENCE CHARACTERISTICS- 

/o 5:^™= 2^ pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

^^P.^^^COl^ TYPE: CIMIA 
(iix) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) PRACEMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 
GACGACGACT GCTACCTGAC TCCA 

24 

(2) INFORMATION FOR SEQ ID NO:289: 

^^L?^^^^^ CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
;C) STRANDEDNESS: single 
(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 
Uxi) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGIHAL SOURCE: 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 
ACAGOGCACT GCTACCTGAC TCCA 

24 

<2) INFORMATION FOR SEQ ID NO:290: 

(i) SEQUENCE CHARACTERISTICS- 

(A) I.ENOTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: unJcnovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<V> FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: . - 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 90: 
^^^^^^^^'^^^TCAGG TAGCAGTC 

/I 18 
(2) INFORMATION FOR SEQ ID NO:291: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
Uii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04ENT TYPE: 
(vi.) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 
^A<K:TCXCAT TTTCCATACA GXCACT 

(2) INFORMATION FOR SEQ ID NO:292: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)cno%m 

5ffK"^^<:^U^ TYPE: CDNA 
liii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 
gXCATCTTG GGGCTGTCGA GAGTAAAAGG TATCTCAGIXT ATAGTTAAGA CCXTCITAAA 

6S 
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(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SBQDENCE CHARACTERISTICS- 
rni 25 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

Jff?.'*OI^CDl^ TYPE: cDNA 
Jill) HYPOTHETICAI,: NO 
(iy) ANTISENSE: NO 
(V) FRAGMENT TYPE- 
(Vi) ORIGINAL SOORCE: 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 
GTAATTTCTA TCAGTAGAAC CCCQA 

(2) INFORMATION FOR SEQ ID NO: 294: 

CHARACTERISTICS: 

J» i^^'- «0 pairs 

(B TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: unfcnovm 

(111) HYPOTHETICAL: NO 
(iy) ANTISENSE: NO 
("w) FRAGMENT TYPE- 
(vi) ORIGINAL SOORCE: 

(Xi) SEQDENCE DESCRIPTION: SEQ ID NO -294- 
«OCTC^. ^^^^ 

(2) INFORMATION FOR SEQ ID NO:29S: 

^"^kf^PSSJ^ CHARACTERISTICS: 
(A) LENGTH: IS base oairs 

(c S^f^""*=l«i«' 

£P»»»DEIMIBSS: Single 
(D) TOPOLOGY: unknown 

M^-^'^Sif??^ WPE: CDNA 
Clii) HYPOIBETICAL: NO 
(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE- 
(vi) ORIGINAL SOURCE: 

(Xi) SEQDENCE DESCRIPTION: SEQ ID NO:29S: 
GTCATCTTG GGGCT 

<2) INFORMATION FOR SEQ ID NO:296: " 

^'^/»f^2?*'*^ CHARACTERISTICS - 
Jnl iSf^= pairs 
J? 3^^= nucleic acid 
;«? STRANDEMresS: single 
(D) TOPOLOGY: unfcnoim 



wo 98/20166 



PCT/US97/20444 



-329- 



(ii) MOIiECOLE TYPE: cDHA 

(iii) HYPOTHETICAL: NO 

(iv) AMTXSEKSE: NO 

(v) FRAi^CENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xl) SEQUENCE DESCRIPTIC^: SBQ ID NO:296: 
CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAQAT 60 
(2) INFORMATION FOR SEQ ID NO: 297: 

<i) SEQOENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 97: 
V5TCATCTTG GGGCTA 

16 

(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SS QUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 
AGCTCTCAT TTTCCATACA TTAAAGATAG TCATCITCGG OCT 

43 

(2) INFORMATION FOR SBQ ID NO:299: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:299: 
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CAGCTCTCAT TTTCCATACA TTAAAGATAG TCATCTTGGG GCTA 

44 

(2) INFORMATION FOR SEQ ID NO: 300: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) EtYPOTHETICAL : NO 

(iv) ANTISENSB: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 
CAGCTCTCAT TTTCCATACA GT 

- . 22 

(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 01: 
CAGCTCTCAT TTTCCATACA T 

21 

(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUE NCE CHARACTERISTICS- 

(A) LENGTH: SO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(ixi) HYPOTHETICAL: HO 
(IV) ANTISENSE: NO 

(v) FRAG^fENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 
GCCTGGTACA CTGCCAGGCG CTTCTGCAGG TCATOGGCAT CGCGGAGGAG 5^ 

(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
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z^ni STRANDEDNBSS: single 
(D) TOPOLOOT: unknown 

liVx*^'^'^'^ TYPE: CDNA 
<fll) HYPOTHETICAL: NO 
<iy) ANTISENSB: NO 
(V) FliACMBHT TYPE: 
ivi) ORIGINAL SODRCE: 

(Xlj SEQUENCE DESCRIPTION: SBQ It> NO: 303- 
««^^««^ CrecCAGGCA CTTCT^CAOG rc^TCOC^r COCGGACQAG 
(2) INFORMATION FOR SEQ ID NO: 304: 
^"^iaf^^lS CHARACTERISTICS: 

frl • '^"cleic acid 

CC) STRANDEDNESS: single 
(D) TOPOLOGY: unknowS^ 

Jff?.'*°^COLE TYPE: cDNA 
Jfil) HYPOTHETICAL: NO 

(IV) AHTISENSE: NO 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOORCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:304- 
<!ATaCOaATG ACCTOCAGAA G 

(2) INFORMATION FOR SEQ ID NO:30S: 

*^}af^?2S!i^ CHAliACTERISTICS: 
(A) I^«3TH: 22 base pairs 
<B) TYPE: nucleic acid 
CC) STRANDEDNESS: single 
(D> TOPOLOGY-: unknown 

fllli'^^f22!£_^== COMA 
JJif^.SJ^^WHETICAL: NO 
(Iv) ANTISENSE: NO 
JV) FRA01ENT TYPE: 
(Vi) ORIGINAL SODRCE: 

(Xi) SEQDBNCE DESCRIPTION: SEQ ID NO:305. 
3ATGCOGATG AOCTGCAGAA GC 

(2) INFORMATION FOR SEQ ID NO:30fi: 

^^l-f^^CE CHARACTERISTICS- 
5o! £S?^= 24 base pairs ■ 
inl I^^' °"<=leic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown^ 

Cil) MOLECDLE TYPE: cDNA 
JJii) HYPOTHETICAL: NO 
(Iv) AHTISENSE: NO 
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(v) FRAOCENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 
CSATGCCGATG ACCTX3CAGAA GTGC 

<2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: iinknown 

^^OZiECOhE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
Uv) ANTISENSE: NO 

(v) FRAOilENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 
GATGATCCGA CG 

12 

(2) INFORMATION FOR SEQ ID NO: 308: 

<^\SEQUBiiJCB CHARACTERISTICS: 

(B) TYPE: nucleic acid 
(C STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
Uli) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAGMENT TYPE- 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:308: 
CTGATGCGTC GGATCATC 

18 

, (2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
Uli) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAG»iENT TYPE- 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:309: 
3ATGATCOGA CG 

12 
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(2) INFORMATION FOR SEQ ID NO: 310: 



i it) SEQUENCE CHARACTERISTICS- 

I (A) LENGTH: 35 base pairs 

I (B) TYPE: nucleic acid 

I (C) STRANDEDNBSS : single 

I (D) TOPOLOGY: unknovm 

1 (ii) MOLECULE TYPE: CDNA 

<iii> HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAIG94ENT TYPE: 

(vi) ORIGINAL SOCFRCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 
<3GCGCQGACA TGGAGGAOGT GTGCGGCCGC CTGGT 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUE NCE CHARACTERISTICS: 
(A) LENGIH: 34 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: tinknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAOCENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 
rCCGCGATGC CGATGACCTC CAGAAGOGCC TCGC 

(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS- 
/o 27 base pairs* 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(ili) HYPOTHETICAL: NO 
(ly) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 
^SGCTGCGAT CACCGTGCGG CACAGCT 

(2) INFORMATION FOR SEQ ID NO:313: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
JB) TYPE: nucleic acid 
(C) STRANDEDNBSS: single 
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(D) TOPOUX3Y: unknown 

I iff? MOLECOLE TYPE: cDNA 

^ iXii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3l3: 
OGGCTGOGAT CACCGTGOGG T 

21 

(2) INFORMATION FOR SEQ ID NO: 314: 
(i) SEQXJENCE CHARACTERISTICS: 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

{f^^?,'*^^COLE TYPE: cDNA 
Uii) HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAGMENT TYPE* 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 
<^QG<^TGCXiAT CACCGTCCGG AACAGCT 

(2) INFORMATION yOR SEQ ID NO: 315: 
(i) SEQUENCE CHARACTERISTICS- 

(B TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)aiown 

TYPE: CDNA 
Jia.1) HYPOTHETICAL: NO 
(XV) ANTISENSE: NO 
(V) FRA04ENT TYPE: 
(vx) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:315: 
^^^^^^^^I'GCGAT CACCGTGOGG CA 

22 

(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS- 
inl 22 base pairs' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

5ff? TYPE: CDNA 

Jill-) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA04ENT TYPE- 

(vi) ORIGINAL SOURCE - 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 
CQGCTGCGAT CACOGfTXSOGG TA 

<2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDKDNESS : single 

(D) TOPOIiOGY: un3aio%m 

(ii) MOIiECUIiE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 
OGGCTGCGAT CACCGTGOGG A 

(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEZ»IESS : single 

(D) TOPOLOGY: xinknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 
ATCATCAACT GGAAGATCAG GTCAGQAGCC ACTTOCCANC CT 
(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: iinJcnown 

<ii) MOLECULE TYPE: cDNA 
Uii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 
ATCATCACAC TGGAAGACTC CAGGTCAGGA GCC 

(2) INFORMATION FOR SEQ ID NO: 320: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: iinkno%m 

(ii) ^^oLEca^LE type: cdna 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:320: 
ATCCACTACA ACTACATGTG TAACAGTTCG wGCwwGCC 
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WHAT IS CLAIMED IS: 

1- A process for determining the sequence of a target nucleic acid 
molecule comprising the steps of: 

a) generating at least two nucleic acid fragments from the 
5 target nucleic acid; and 

b) analyzing the at least two fragments by a mass 
spectrometry format, and thereby determine the sequence 

of the target nucleic acid molecule. 

2. A process of claim 1, wherein in step a), an endonuclease is 
10 contacted with the target nucleic acid to generate the at least two 

nucleic acid fragments. 

3. A process of claim 2. wherein the endonuclease is a restriction 
enzyme that can recognize and cleave at least one restriction site in the 
target nucleic acid. 

4. A process of claim 2, wherein the target nucleic acid is a 
deoxyribonucleic acid and the nuclease is a deoxyribonuclease. 

5. A process of claim 2, wherein the target nucleic acid is a 
ribonucleic acid and the nuclease is a ribonuclease. 

6. A process of claim 5, wherein the ribonuclease is selected from 
20 the group consisting of: the G-specific T, ribonuclease, the A-specific 

ribonuclease, the A/U specific PhyM ribonuclease, the U/C specific 
ribonuclease A, the C-specific chicken liver ribonuclease and crisavitin. 

7. A process of claim 1, wherein in step a), nucleic acid 
fragments are generated by performance of a combined amplification and 

25 base-specific termination reaction. 

8. A process of claim 7, wherein the combined amplification and 
base-specific termination reaction is performed using a first polymerase, 
which has a relatively low affinity towards at least one chain terminating 
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nucleotide and an a second polymerase, which has a relatively high 
affinity towards at least one chain terminating nucleotide. 

9. A process of claim 8, wherein the first and second polymerases 
are thermostable DNA polymerases. 

10. A process of claim 9, wherein the thermostable DNA 
polymerases are selected from the group consisting of: Taq DNA 
polymerase, AmpliTaq FS DNA polymerase. Deep Vent (exo-) DNA 
polymerase. Vent DNA polymerase. Vent (exo ) DNA polymerase Vent 
DNA polymerase. Vent (exo") DNA polymerase. Deep Vent DNA ' 
polymerase, Thermo Sequenase, exo(-) Pseu<fococcus furiosus KPfu) DNA 
polymerase, AmpliTaq, Ultman, 9 degree Nm, Tth, Hot Tub, Pyroaoaous 
funosus iPfu) and Pyrococcus woesei {Pv^o) DNA polymerase 

1 1 . A process of claim 1 . wherein the at least two nucleic acid 
fragments generated in step a) include mass modified nucleotides 

12. A process of claim 1, wherein the at least two fragments 
comprise a 3' tag. 

13. A process of claim 1, wherein the at least two fragments 
comprise a 5' tag. 

20 tag. a non-natural 

1 5. A process of claim 14, wherein the non-natural tag is selected 
from the group consisting of: an affinity tag and a mass marker. 

1 6. A process of claim 1 5, wherein the affinity tag facilitates 
immobilization of the nucleic acid to a solid support. 

25 1 7. A process of claim 1 6, wherein the affinity tag is biotin or a 

nucle.c acid sequence that is capable of binding to a capture nucleic acid . 
sequenc that is bound to a solid support. 



15 
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18, A process of claim 1 , wherein the process additionally 
comprises the step of: ordering the at least two nucleic acid fragments to 
determine the sequence of the target nucleic acid, 

1 9, A process for detecting a target nucleic acid present in a 
5 biological sample, comprising the steps of: 

a) performing on a nucleic acid obtained from a biological sample; 
a first polymerase chain reaction using a first set of primers, which 
are capable of amplifying a portion of the nucleid acid containing 
the target nucleic acid, thereby producing a first amplification 

10 product; and 

b) detecting the first amplification product by mass spectrometry, 
wherein detection of the target nucleic acid indicates that the 
target nucleic acid is present in the biological sample, 

20. A process of claim 19, wherein prior to step b), a second 

15 polymerase chain reaction is performed on the first amplification product 
using a second set of primers, which are capable of amplifying at least a 
portion of the first amplification product, which contains the target 
nucleic acid. 

21. A process of claim 19 or 20, wherein prior to step b), the 
20 target nucleic acid is immobilized to a solid support. 

22. A process of claim 21, wherein the target nucleic acid is 
reversibly immobilized. 

23. A process of claim 22, wherein the target nucleic acid can be 
cleaved from the solid support by a chemical, enzymatic or physical 

25 process. 

24. A process of claim 23, wherein immobilization is 
accomplished via a photocleavable bond. 

25. A process of claim 22, wherein the target nucleic acid is 
cleaved from the support during step b). 
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26. A process of claim 21 , wherein the solid support is selected 
from the group consisting of: beads, flat surfaces, chips, capillaries pins 
combs and wafers. ' ^ ' 

27. A process of claim 21, wherein immobilization is 

5 —mplished by hybridization between a complementary capture nucleic 
acd molecule immobilized to a solid support, and a portion of the nucleic 
acd molecule, which is distinct from the target nucleic acid sequence 

28. A process of claim 19 or 20, wherein prior-to step b), the 
target nucleic acid is purified. 

10 29. A process of claim 1 9 or 20, wherein the primer or first or 

second amplification product is conditioned. 

30. A process of claim 29, wherein the primer or first or second 
amplification product is conditioned by phosphodiester backbone 
modification, 

15 31. A process of claim 30, wherein the phosphodiester backbone 

modification is a cation exchange. 

32. A process of claim 29, wherein the primer or first or second 
amplification product is conditioned by contact with an alkylating agent 
or trialkylsilyl chloride. 

33. A process of claim 29, wherein conditioning Is effected by 
.ncluding at least one nucleotide that reduces sensitivity for depurination 
<n the pnmer or first or second amplification product. 

34. A process of claim 33, wherein the nucleotide is an N7- or 
N9. deazapurine nucleotide or 2' fluoro 2' deoxy nucleotide. 

35. A method for detecting neoplasia/malagnancies in a tissue 
or cell sample, comprising detecting telomerase activity, mutation of a 
proto-oncogene, expression of a tumor specific gene in the sample by 
detecting nucleic acids that encode the telomerase, that are specific for 
the mutation or that encode the tumor-specific by mass spectometry 
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36. The method of claim 35 that is a method for detecting 
neoplasia/malagnancies In a tissue or cell sample, comprising: 

a) isolating telomerase from the sample and adding a 
synthetic DNA primer, which is optionally 
immobilized, complementary to a telomeric repeat, 
and all four deoxynucleotide triphosphates under 
conditions that result in telomerase specific extension 
of the synthetic DNA; 

b) amplifying the telomerase extended DNA product; and 



c) detecting the DNA product by mass spectrometry, 

wherein telomerase-specific extension is indicative of 
neoplaisa/malignancy. 

37. The method of claim 36, wherein the primer contains a 

1 5 linker moiety for immobilization on a support; and the amplified primers 
are isolated conjugating the linker portion to a solid support. 

38. The method of claim 35 that is a method for identifying 
transformed cells or tissues, comprising: 

a) in a cell or tissue sample, amplifying a portion of a 
Proto-oncogene that includes a codon indicative of 
transformation, wherein one prinier comprises a linker 
moiety for immobilization; 
c) immobilizing DNA via the linker moiety to a solid 
support, optionally in the form of an array; 
^® hybridizing a primer complementary to the proto 

oncogene sequence that is upstream from the codon 
e) adding 3dNTPs/1 ddNTP and DNA polymerase and 
extending the hybridized primer to the next ddNTP 
location; 
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f) ionizing/volatizing the sample; and 

g) detecting the mass of the extended DMA, whereby 
mass Indicates the presence of wild-type or mutant 
alleles. The presence of a mutant allele at the codon 
is diagnostic for neoplasia. 

38. Wherein meproto-oncogene is .he 
RET-proto-oncogene. 

40. The method of claim 35 that Is a method for detecting 
expression of a tumor-specific gene, comprislng: 

Isolating polyA RNA from the sample- 

c) preparing a cDNA library using reverse transcription- 

d) amplifing a cONA product, or portion thereof, of the 
tumor-specific gene, wherein one oligo primer 
comprises a linker moiety; 

^•"P«««<^ Product by immobilizing the 
DNA to a solid support via the linker moiety; 
f) optionally conditioning the DNA: 
of » nM. ^' '""i^ing/volatizing sample and detecting *e presence 
= DNA peak tf,a. is indicative of expression of the gene 

eel, 1'' bone maao 

cells, the gene is «,e tyrosine hydroxylase gene, and expression of the 
gene is indicative of neuroblastoma. 

42. A method for directly detecting a double-stranded nucleic 

26 CTTOFr*"°"'"°' "'-<'=?°^«°"'--i-on <MALO,,-«me.,f- 
night (TOF) mass spectrometry, comprislng: 

a) Isolating a double-stranded DNA fragment from a cell or 

tissue sample; 
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b) preparing the double-stranded DNA for analysis under 
conditions that Increase the ratio of dsDNArssDNA, wherein 
the conditions include one or all of the following: preparing 
samples for analysis at reduced temperatures (i.e. 4 C) 

^ and using of higher DNA concentrations in the matrix to 

drive duplex formation; 

c) ionizing/volatizing the sample of step b), wherein low 
acceleration voltage of the ions are used; - 

d) detecting the presence of the double-straned DNA, 
10 43. A method for comparing DNA samples to discern 

relatedness or to detect mutations, comprising: 

a) obtaining biological a plurality of samples; 

b) amplifying a region of DNA from each sample that contains 
two or more microsatellite DNA repeat sequences; 

IS c) ionizing/volatizing the amplified DNA; 

d) detecting the presence of the amplified DNA and comparing 
the molecular weight of the amplified DNA, wherein 
different sizes are indicative of non-identity between or 
among the samples. 
20 44, The method of claim 43, wherein non-identity is indicative 

of the presence of a mutuation in the DNA in one sample, non- 
relatedness or non-HLA compatibility between or among the individuals 
from whom the samples were obtained. 

45. The method of claim 43 or 44, wherein a plurality of 
25 markers are examined simultaneoulsy. 



wo 98/2016ti 



PCT/US97/20444 



-344- 



10 



46. A method for detecting a target nucleic acid in a sample, 
comprising: 

a) amplifying a target nucleic acid sequence using; 
(i) a first primer, wherein: 

the 5'-end shares identity to a portion of the target 
DNA immediately downstream from the targeted codon 
followed by a sequence that introduces a unique restriction 
endonuclease site, and 

the 3'-end primer is self-complementary; and 
(ii) a second downstream primer that contains a tag; 

b) immobilizing the double-stranded amplified DNA to a solid 
support via the linker moiety; 

c) denaturing the immobilized DNA and isolating the non- 
immobilized DNA strand; 

1 5 d) annealing the intracomplementary sequences in the 3'-end 

of the isolated non-immobilzed DNA strand, such that the 
3'-end is extendable by a polymerase; 
f) extending the annealed DNA by adding DNA polymerase, 
3 dNTPs/1 ddNTP; 

20 g) cleaving the extended double stranded stem loop DNA with 

the unique restriction endonuclease and removing the 
cleaved stem loop DNA; 
i) ionizing/volatizing the extended product; and 
i) detecting the presence of the extended target nucleic acid, 
whereby the presence of a DNA fragment of a mass 
different from wild-type is indicative of a mutation at the 
targ t codon(s). 
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47. A method for detecting a target nucleic acid in a biological 
sample using RNA amplification, comprising: 

amplifying the target nucleic acid using a primer comprising a 
region complementary to the target sequence and a region that encodes 
5 a promoter; 

synthesizing RNA using an RNA polymerase the recognizes the 
promoter; 

detecting the resulting RNA using mass spectrometry. 

48. A primers for mass spectrometric analyses, comprising all or 
10 at least about 20, preferably about 16, bases of any of the sequence of 

nucleotides sequences set forth in SEQ ID NOs. 1-22, 24, 27-38 41-86 
89, 92, 95, 98, 101-110, 112-123, 126, 128 and 129, wherein the 
primer is unlabled. 

49. The primers of claim 48, further comprising a mass 
15 modifying moiety. 

50. A process for detecting a target nucleic acid sequence 
present in a biological sample, comprising the steps of: 

a) obtaining a nucleic acid molecule containing a target ncleic 
acid sequence from a biological sample; 

immobilizing the target sequence on the support via thiol 
linkages, whereby the target is present at a sufficient density to detect it 
using mass spectrometry; 

c) hybridizing a detector oligonucleotide with the target nucleic 
acid sequence; 

d) removing unhybridized detector oligonucleotide; 

e) ionizing and volatizing the product of step c); and 

f) detecting the detector oligonucleotide by mass spectrometry, 
wherein detection of th detector oligonucleotide indicates the presence 
of the target nucleic acid sequence in the biological sample. 
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51 . The process of claim 50, wherein the target nucleic acid 
molecule is amplified prior to immobilization. 

52. The process of claim 50 or 52, wherein at least one of the 
detector oligonucleotide or the target nucleic acid sequence has been 

5 conditioned. 

53. A process of any of claims 50-52, wherein the solid support is 
selected from the group consisting of: beads, flat surfaces, pins and 

combs, 

54. A process of any of claims 50-53. wherein target nucleic 
10 acid IS immobilized in the form of an array. 

55. A process of any of claims 50-54, wherein the support is a 
silicon wafer. 

56. A process of any of claims 51-55, wherein the target 
nucleic acid moelcule is amplified by an amplification procedure selected 
from the group consisting of cloning, transcription, the polymerase chain 
reaction (PGR), the ligase chain reaction (LCR), and strand displacement 
amplification (SDA). 

57. A process of any of claims 50-56, wherein the mass 
spectrometer is selected from the group consisting of: Matrix-Assisted 
Laser Desorption/lonization Time-of-Flight (MALDI-TOF), Electrospray 
(ES), Ion Cyclotron Resonance (ICR), and Fourier Transform. 

58. A process of any of claims 50-57, wherein the sample is 
conditioned by mass differentiating at least two detector oligonucleotides 
or oligonucleotide mimetics to detect and distinguish at least two target 

25 nucleic acid sequences simultaneously. 

59. A process of claim 58, wherein the mass differentiation is 
achieved by differences in the length or sequence of the at least two 
oligonucleotides. 
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60. A process of claim 59, wherein the mass differentiation is 
acheived by the Introduction of mass modifying functionalities in the 
base, sugar or phosphate moiety of the detector oligonucleotides. 

61. A process of claim 58, wherein the mass differentiation is 
5 achieved by exchange of cations at the phosphodiester bond. 

62. A process of any of claims 50-61 , wherein the nucleic acid 
molecule obtained from a biological sample Is amplified into DNA using 
mass modified dideoxynucleoside triphosphates and DNA dependent 
DNA polymerase prior to mass spectrometric detection. 

10 63. A process of any of claims 50-62, wherein the nucleic acid 

molecule obtained from a biological sample is amplified into RNA using 
mass modified ribonucleoside triphosphates and DNA dependent RNA 
polymerase prior to mass spectrometric detection. 

64. A process of any of claims 50-63, herein the target nucleic 
15 acid sequence is indicative of a disease or condition selected from the 

group consisting of a genetic disease, a chromosomal abnormality, a 
genetic predisposition, a viral infection, a fungal infection and a bacterial 
infection. 

65. A method of determining a sequence of a nucleic acid, 
20 comprising the steps of: 

(I) obtaining multiple copies of the nucleic acid to be sequenced: 

(II) cleaving the multiple copies from a first end to a second end 
with an exonuclease to sequentially release individual nucleotides; 

(iii) identifying each of the sequentially released nucleotides by 
25 mass spectrometry; and 

(iv) determining the sequence of the nucleic acid from the 
identified nucleotides, wherein the nucleic acid is immobilized by covalent 
attachment to a solid support via at least one sulfur atom. 



wo 98/20166 



PCTAJS97/20444 



-34S- 



66. A method of determining a sequence of a nucleic acid, 
comprising the steps of: 

(0 obtaining multiple copies of the nucleic acid to be sequenced- 

(ii) cleaving the multiple copies from a first end to a second end' 
5 wrth an exonuclease to produce multiple sets of nested nucleic acid 

fragments; 

(iii) detem,ining the molecular weight value of each one of the sets 
of nucleic acid fragments by mass spectrometry; and 

(iv) determining the sequence of the nucleic acid from the 

10 molecular weight values of the sets of nucleic acid fragments, wherein 
the nucleic acid is immobilized by covalent attachment to a solid support 
via at least one sulfur atom. 

67. The process of claim 65 or 66, wherein the nucleic acids are 
covalently bound to a surface of the support at a density of at least 20 

lo fmol/mm^. 

68. The method of any of claims 50-67, wherein immobilization 
IS effected by a method comprising: 

reacting a thiol-containing insoluble support with a nucleic acid 
: comprising a thiol-reactive group under conditions such that a covalent 
bond is formed; 

thereby immobilizing the nucleic acid on the insoluble support. 

69. The method of claim 68, further including the step of 
modifying the insoluble support with a thiol-containing reagent, to form a 
thiol-containing insoluble support. 
25 70. The method of claim 68 or 69. wherein the thiol-reactive 

cross-linking reagent is N-succinimidyl (4-iodoacetyl) aminobenzoate 
(SIAB). 

71 . The method of claim 65 or claim 66, wherein the nucleic 
acid is a 2'-deoxyribonucleic acid (DNA). 
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72. The method of claim 65 or claim 66, wherein the nucleic 
acid is a ribonucleic acid (RN A). 

73. The method of any of claims 65-71, wherein the 
exonuclease is selected from the group consisting of snake venom 
phosphodiesterase, spleen phosphodiesterase, BaI-31 nuclease, E. coli 
exonuclease I, E. coli exonuclease VII, Mung Bean Nuclease, SI 
Nuclease, an exonuclease activity of soH DNA polymerase 1, an 
exonuclease activity of a Klenow fragment of DNA polymerase 1, an 
exonuclease activity of T4 DNA polymerase, an exonuclease activity of 
T7 DNA polymerase, an exonuclease activity of Taq DNA polymerase an 
exonuclease activity of DEEP VENT DNA polymerase, £o!i exonuclease 
III, lambda exonuclease and an exonuclease activity of VENT„DNA 
polymerase. 

74. The method of any of claims 65-74, wherein the nucleic acid 
15 comprises mass-modified nucleotides. 

75. The method of claim 74, wherein the mass-modified 
nucleotides modulate the rate of the exonuclease activity. 

76. The method of claim 74. wherein the sequentially released 
nucleotides are mass-modified subsequent to exonuclease release and 

20 prior to mass spectrometric identification. 

77. The method of claim 76, wherein the sequentially released 
nucleotides are mass-modified by comact with an alkaline phosphatase. 

78. A method of any of claims 65-77, wherein the mass 
spectrometry format is matrix assisted laser desorption (MALDI) mass 

25 spectrometry or electrospray (ES) mass spectrometry. 

79. A method of any of claims 65-79, wherein immobilization is 
ffected by a method, comprising: 
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reacting the surface of the substrate with a soiution of 3 
aminopropyltriethoxysilane to produce a uniform layer of primary amines 
on the surface of the substrate; and 

derivatizing the surface of a substrate with iodoacetamindo 
5 functionalities by reacting the uniform layer of primary amines with a 
solution of N-succinimidyl (4^odoacetyl) aminobenzoate (SIAB). 

80. A primer, comprising all least about 20, preferably about 16 
bases of any of the sequence of nucleotides sequences set forth in SEQ 
ID NOs. 1-22, 24, 27-38, 41-86, 89, 92, 95, 98, lOl-l 10 1 12-123 

10 126, 128 and 129. ' '1^123, 

81 . The primers of claim 80 that is unlabeled, and optionally 
.ncludes a mass modifying moiety, which is preferably attached to the 

5'end. 

82. The method of any of claims 1-79, Wherein nucleic acid is 
1 5 immobilized to a solid support via a selectively cleavable linker. 

83. The methbd of claim 82, wherein the linker is 
thermocleavable, enzymatically cleavable, photocleavable or chemically 
cleavable. 

82. The method of claim 82, wherein the linker is a trityl linker 

83. The method of claim 82, wherein the linker is selected from 

the group consisting of 1-(2-nitro-5-(3-0-4,4'-dimethoxytritylpropoxy)- 

phenyl)-l-0-((2-cyanoethoxy)-diisopropylaminophosphino)ethane and 1- 

(4-{3-0-4,4'-dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)-1-0-((2- 
cyanoethoxy)-diisopropylaminophosphinp)ethane. 
25 84. A photolabile linker, comprising a compound of formula: 



30 



wo 98/20166 



PCT/US97/20444 



-351- 




(I) 



is selected from the group consisting of a/-(4,4'- 
dimethoxytrityloxy)alkyl and ««/-hydroxyalkyl; 
15 R2, i3 3^,3^,^^ ^^^^ g^^^p consisting of hydrogen, alkyi, aryl 

alkoxycarbonyl, aryloxycarbonyl and carboxy; 

R« is selected from the group consisting of hydrogen and 
(dialkylamino) (a/-cyanoalkoxy} P-; 
t is 0-3; and 

20. R- i3 selected from the group consisting of alkyl. alkoxy, aryl and 

aryloxy, 

85. The photocleavable linker of claim 84, wherein the linkers 
are of formula 11: 

25 




30 
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wherein: 

R** Is selected from the group consisting of to-(4,4'- 
dimethoxytrityloxylalkyl, a/-hydroxyaIkyl and alkyi; 

R^' Is selected from the group consisting of hydrogen, alkyI, aryl, 
alkoxycarbonyl, aryloxycarbonyl and carboxy; 

R" is selected from the group consisting of hydrogen and 
(dialkylamino)(a/-cyanoalkoxy)P-; and 

X^* is selected from the group consisting of hydrogen, alkyI or 

86. The photocleavable linker of claim 85, wherein: 
R^ is selected from the group consisting of 3-(4,4'. 
dimethoxytrityloxy)propyl, 3-hydroxypropyl and methyl; 

R^' is selected from the group consisting of hydrogen, methyl and 
1 o carboxy; 

R" is selected from the group consisting of hydrogen and 
(diisopropylamino)(2-cyanoethoxy)P-; and 

X^o is selected from the group consisting of hydrogen, methyl or 

OR , 

87. The photocleavable linker of claim 85, wherein: 
R** Is 3-(4,4'-dimethoxytrityloxy)propyl; 
R^' is methyl; 

R" Is (diisopropylamino)(2-cyanoethoxy)P-; and 
X^ is hydrogen. 

88. The photocleavable linker of claim 86. wherein; 
R** is methyl; 
R^' is methyl; 

R" is (diisopropylamino)(2-cyanoethoxy)P-; and 
X** is 3-(4,4'-dimethoxytrityloxy)propoxy. 



20 



25 



WO!>8AZ0166 



PCT/US97/20444 



Hi: 



-353- 

88. A photocleavable linker, comprising a compound of formula 




(III) 



10 ^^^-Qp23 



1 5 wherein: 

R" is selected from the group consisting of hydrogen and 
(dialkylamino) {a;-cyanoalkoxy) P-; 

R^" is selected from a;-hydroxyalkoxy, £«^-(4,4'- 
dlmethoxytrityloxy)alkoxy. <.^hydroxyalkyl and w-{4,4'- 
20 dimethoxytrityloxy)alkyl, and is unsubstituted or substituted on the a.kyi 
or alkoxy chain with one or more alkyi groups; 
r and s are each independently 0-4; and 
R*** is alkyI, alkoxy, aryl or aryloxy. 

89. The photocleavable linker of claim 88, wherein: 

25 is a,-hydroxyalkyl or a;-(4,4'-dimethoxytrityloxy)alkyl, and is 

substituted on the alkyI chain with a methyl group. 

90. The photocleavable linker of claim 88, wherein: 

R" is selected from the grouR consisting of hydrogen and 
{diisopropylamino)(2-cyanoethoxy)P-; and 
30 R« is selected from the group consisting of 3-hydroxypropoxy 3- 

(4,4'-dimethoxytrityloxy)propoxy, 4-hydroxybutyl. 3-hydroxy-1 -propyl 1 
hydroxy-2-propyl, 3-hydroxy-2-methyl-1 -propyl. 2-hydroxyethyl 
hydroxymethyl, 4-(4,4'-dimethoxytrityloxy)butyl, 3-(4,4'- 
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dimethoxytrltyloxy)-1 -propyl, 2-(4,4'-diniethoxytrityloxy)ethyl, 1-(4 4-. 
dimethoxytrityloxy)-2-propyI, 3-(4,4'-dimethoxytriyIoxy)-2-methyI-1- 
propyl and 4,4'-drmethyoxytrityloxymethyI. 

91 . The photocleavable linker of claim 90, wherein r and s are 
both O. 

92. The photocleavable linker of claim 91, wherein: 
R^' is (dilsopropylamino)(2-cyanoethoxy)P-; and 

R^" is selected from the group consisting of 3-(4,4'-dimethoxy- 
trityloxy)propoxy, 4-(4,4'-dimethoxytrityloxy)butyl, 3-{4.4'-dimethoxy- 
trityloxy)propyI, 2-(4.4'-dimethoxytrityloxy)ethyl, 1 -(4,4'-dimethoxy 
trityloxy)-2-propyl, 3-(4,4'-dimethoxytriyloxy)-2-methyl-1 -propyl and 4,4'- 
dimethyoxytrityloxymethyl . 

93, The photocleavable linker of claim 92, wherein: 
R^* is 3-(4,4'-dimethoxytrityloxy)propoxy. 

94. The photocleavable linker of claim 84, where in the linker is 
selected from the group consisting of 1-(2-nitro-5-(3-0-4,4'-dimethoxy 
tritylpropoxy)phenyl)-1-0-((2-cyanoethoxy)-diisopropylaminophosphino)- 
ethane and 1-(4-(3-0-4.4'-dimethoxytritylpropoxy)-3-methoxy-6-nitro- 
phenyl)-l-0-((2-cyanoethoxy)-diisopropylaminophosphino)ethane. 
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SEQUENCE OF THE AMPURED 209 bp PCR-PRODUCT OF THE /S-GLOBIN GENE 



FORWARD PRIMER: /S2 
CATTTGCTTC TOACACAACT GTGTTCACTA GCAACCTGAA ACAGACACCA 
12mer PRIMER 

TGQiw^l GaciCCTGTG GAGAAGTCTG CCGTTACTGC CCTGTGGGGC 
AAGGTGAACG TGGATGAAGT TGGTGGTGAG GCCCTGGGCA GGTTGGTATC 
AAGCTTACAA GACfJSGTTTA AGGAGACCAA TAGAAACTGG GCATGTGGAG 
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5' -GTGTGT6TGTGTGTGTGTTTTT (TT) (IT) AACAGGGA7TTGGGGAATTATrrGAGA-3' 
PWMER TTCTCCCTAAACCCCTT (4448.0) 

15 AliELE CAAAM TTGTCCCTAAACXXm (6890.6) 
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date. Whichever time Emit expves later. It should t>e noted, however, that the amendments wiO be considered 
as having t>een received on time if they are received t>y the Intemational Bureau after the expiration of the 
applicable tirrte Emit but before the completion of the technical preparations for intemational putslication 
(FUto46.1). 



Where not to file the amendments? 

The amerximents may only t>e filed with the Intemational Bureau and not with the receiving Office or the 
■ Intemational Searching Authority (Rule 46^. 

^ Where a demand for intemational preGminary examination has tDeenAs filed, see t>elow. 



tiow? Gttier tvy canoeCEng one or more enC^ claims, by adding one or more new claims or by amerKfing the text of 

one or more of ttie claims as ffled. 

ArepCaoement sfieet must t>e sutxnitted for each sheet of the claims wtuoh, on account of an amendment or 
amendments, differs from the sheet origmally filed. 

Afl the claims appearing on a reptaoement sheet must t>e numbered in Aratxc riumerats. Where a claim is 
cancelled, rK> renumtsering of the other claims b required. In afl cases where dai ms are renumt»ered, they must 
be renumt>ered consecutively (Administrative Insfauctions, Section 205(b)). 

The amendments must be made In the language in which the International application Is to be published. 



What documents must/may accompany tt>e amendments? 
Letter (Section 20S(b)): 

The amendments must be submitted with a letter. 

The letter win not be putsGshed with the intemational appfication and the amer>ded claims. It should not t>e 
confused with tfie "Statemertt under Article 1 9(1)' (see t>elow. under "Statement under Article 1 9(1 )*). 

The letter must be In English or French, at the choice of the applicant. However, If the language of the 
Intemational application Is Engfish, the letter must be in Engfish; if the Unguage of the intemational appfication 
Is French, the letter must t>e In French. 
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NOTES TO FORM PCT/ISA/220 (continued) 



The letter must indicate the differenoes t>etween the claims as filed and the claims as amended, ft must, in 
particular, v^ficate, in connection with each daim appearing in the intemational appficatton (it t>eing understood 
that identical indications concerning several claims may be grouped),whether 

d) the daim is unchanged; 

(B) the daim is cancelled; 

(iS) the daim is new; 

(iv) the daim replaces one or more daims as filed; 

(v) the daim is tlie resuft of the division of a daim as filed. 



The foNowing examples Illustrate the manner in which amendments must foe explained in the 
accompanying letter: 

1. (Where originally there were 48 datms and after amendment of some daims there are 51]: 
X^ms 1 to 29, 31 , 32, 34, 35, 37 to 48 replaced by amended claims t>earing the same numliera; 
daims 30, 33 and 36 unchanged; new daims 49 to 51 added." 

2. (Where originady there were 1 5 daims and after amendment of all daims there are 11]: 
X^laims 1 to 15 replaced by amended daims 1 to 1 1 .* 

3. (Where originally there were 1 4 datms and the amendments consist in cancelling' some daims and in ad<£ng 
new daims]: 

X)laims 1 to 6 and 14 unchanged; daims 7 to 13 cancelled; new daims 15, 16 and 17 added." or 
X:taims7to 13 cancelled; new daims 15, 16 and 17 added; all other daims unchanged.* 

4. (Where various kir>ds of amendments are made]: 

"Claims 1-10 unchanged; daims 11 to 13, 18 and 19 cancelled; daims 14, 15 and 16 replaced by amended 
daim 14; daim 17 sutxSvided into anriended daims 15, 16 and 17; new daims 20 and 21 added.* 



^Statement under article 19(1)" (Rule 46-4) 

The amendments may be accomparued by a statement explaining the amendments and in<ficatir>g any impad 
that such amendments might have on the descr^ition and the drawings (which cannd t>e amended under 
Artide1d(1)). 

The statement wSl be published with the intemational application and the amended dakns. 
It must be In the language In which the IntemaUonal appplication Is to be published. 
It must be brief, not exceeding 500 words if in Eng&sh or if translated into Eng&sh. 

It should not be confused with and does not replace the letter irKficating the differences t>ctween the daims 
es filed and as amended. It must be filed on a separate sheet and must t>e identified as such by a heading, 
preferably by using Ihe words •Statement urKler Aitide 1 9(1)-* 

It may not contain any tfsparaging comments on the international search report or the relevar>ce of citations 
contained in that report Reference to citations, relevant to a given dakn. contained in the intemational search 
report may be made only in connection with an amendment of that daim. 



Consequence If a demand for international prdimlnary examination has already been filed 

If, at tfie time of fifing any amendments ur>der Article 1 9, a demand for intemational preliminary examination 
has already l>een sutxnitted, the applicant must preferably, at the same time of filing the amendments with the 
International Bureau, also file a copy of such amendments with the International Preltrninary Examinir^ 
Authority (see Rule 62.2(a), first sentence). 



Consequence with regard to translation of the Intemational application for entry Into the national phase 

The applicant's attention is drawn to the fad that, wtiere upon entry into the national phase, a translation of the 
daims as amended under Article 19 may have to be furnished to the design ated/eleded Offices, instead of, or 
in ad(£tion to, the translation of the daims as filed. 

For further details on the requirements of each destgnated/Seleded Office, see Volume II of the POT Applicant's 
Guide. 
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This Inteoiational Search Report consists of a total of Ifl sheets. 
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1. Q Certain claims were found unseafxrfiable (see Box 0- 




(Xl ^^of invention is lacking (£ 


tee Box IQ. 





3' [X] The intemational application contains cRsdosure of a nucleotide and/or amino acid sequence listing and the 
intemational search was carried out on the basis of the sec^ence fisting 

fx] filed with the ffitemational appCcation. 

I I furnished tiy the applicant separately from the intemational application, 

I I but not accompanied by a statement to the effect that it did not include 
— matter goir^ beyond the disclosure in the intemational application as filed. 

I I Transcritjed by this Authority 



4. With regard to the title, the text is approved as submitted by the appGcant 

( I tfw text has tjeen established by thb Authority to read as follows: 



5. With regard to the abstract. 



fy] the text fe approved as 6Ut>mtttedt>y the appficant 

I I the text has been estatiished, aooonfing to Rule 38.2(b), by this Authority as it appears in 
— Box III. The applicant may, within one month from the date of mailing of this International 
Search Report, submit comments to this Authority. 



6. The figure of the drawings to k>eput3&shed with the abstract is: 

RguneNo. Q as suggested tiy the applicant. 

I I because the applicant fafled to suggest a figure. 

I I t>ecausff tins figupB better characterizes the invention. 



fyl None of tfie figures. 
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I Observations where certain claims were found unsearchable (Continuation of Hem 1 fHrst sheet) 



TWs hrtemational Se^ Report has not been estabfished in respect of certain claims under Article 17(2){a) for the following reasons: 
n Si^^eyrelaletosutiectmatterno.re<,uiredtobesearchedbythbAuthority.nan«^^ 



^ S^S^eVrelaletopartsoftheUrtematiooalApplicaticnthatdonoto^^ 

anexlent thil no meartngfu! Intemational Search can be earned out. speofically: 



Q are dependent dairns and not *a«ed in accordance with the second «nd*irds^ 



Box II Observations where unity of Inv ention is lacking (ConCnuagon of Hern 2 of first sheet) 

TWs Intemafional Searching Authority found multiple inventions in this intemational appncation. as foBcws: 

SEE ANNEXES 



1 . m As all required additional search fees were ti'i^I^ly paid by the applicant, this Intemational Search Report covers eSt 

searchable daims. 

2. rt As an searchaWe daims couW be searched without effort justifying an additional fee, this Authority did not invite payment 
' ' of any addHional fee. 



3 n Asonlysomeoftherequiredadditionalsearchfeesweretmelyp^ 

I I covers only those daims lor whidi fees were paid, spedficaHy daims Nos.: 



4 n No required adcStionalseard, fees were ernetypak* by the 

' ' restrSed to the invention first mentioned in the daims; a is covered^ 



Recnartc on Protest 



JX] The ad<Stional search fees were aooonyjanied by the appBcant'spro^ 
j j No protest aooon^panied the payrnertt of adtfitiofial oeardi fees- 
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^-orf that "claims 82-83" as 
- snculd .e Z cxa..s 3. and two claims 

^:^^rcrwer;-- ^^^^^ " 

eLnd ciaim 8. an. a second oXa.m 8.) 



...Claims 1-18, partially 82-83: 



• ^^<, the sequence ot a target 
. .ecnod ror ^^-^^^^^^^^^^^^.e^.^cion of base speciti- 
nucleic acid involvxng t:he gene. 
^^My terminated fragments. 



• ic,_34 partially 82-83: 
Claims ly-J**' 



• . taraet nucleic acid present in 
. ™etnod tor ^^^^^^ ^ nested polymerase chain 
a biological sample based 
^^^pUiication reaction. 

-ha- relates to the detec- 
3. Claim 35 partiaiiv » - - .detecting telomerase) . 

n^opiasia/maiiqnancies Jy - 

cxax^.s 36 and 37, ana p..- 

.or detection of neoplasia/maiignancies 

An assay .or a extension of a substrate 

•oased on telomerase ^^^^^^^ ^^^^^.^^ telomerase 
p.iruer and a subsequent amplification 
specific extension product by PCR. 

, de<-ecT;ion of neoplasia involving 

V-VZ In-au^is =: »u.an. or -iW-tVPa alleles .V 

LLnsIon .ea^ion a Sanger .ype se<^enc.ng 



• o^^^^aiiv (in that it relates to the detec- 

5. C— n« -1= P ,,i<,n,nc'es by detecting expression 

tion of n-P^-=-'»^';^::"I; , specific tissue type,. 

Claims 40 and 41, ana p<~ 

■ =,«=«av for the expression of the 
An arapiiticacion basea as^ay for c y 

. oene in bone marrow cells as 

tyrosine hydroxylase gene 

indicative of a neuroblastoma. 

.3. Claim 42. partially claims 82-83: 

. ....oo .or d.rec.ly defecting double stranded nucleic 
acid using Maldi-roF mass spectrometry. 

7. Claims 43-4'., partially claims 82-83: 

^nar^ra ONA relatedness by amplification 
A maTihcd rcr comparing un^ 
■ .. i^re DNA repeat sequences. 

ts. .;-aim lu, partially claims ei-S-i: 

. ,.^r=a tor detecting ..utations i^ased on target ampli- 
■t.cs^^on using a primer that introduces a unique 
t;:::;uoiease restriction site into amplified target and 
^ coaoinatlon of a Sanger sequencing protocol and 
.sr.donuciease digestion. 

^. Claim 47. parrially claims 82-S3: 



and detection of a 
esis of RliA using c 
crimerc7ntaining a RNA polymerise promoter sequence. 



. ^echcc for the amplification 

" ^ c:wnth««i5 of taiA using a 

nuclei;; acid based on the syntn— — 



iO. Ciai^is 43. 49, 80 



and 81. partially 82^83: 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



Primers per se for mass spectrometry comprising a mass 
modifying mciety. 

il. Claims 50-64, partially 68-70, pari^ially 73-79, 
partially claims 82-83: 

MsTihcci ror dececting a target nucleic acid sequence 
involving hybridisation to a detector oligonucleotide • 

1^. Claims 65-67, partially 68-70, 71-72, partially 
7:^-79, partially claims 82-83: 

Me'::hcds for determining a nucleic acid sequence involv- 
ing eMonuciease digestion. 

13, Clai-ns S4-94: 



Fhotoxabile linkers per se for use in immobilisation of 
nucleic acids to solid supports • 
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Y 
A 




2-7 




see p39^ "» nne ao paye xo, line 


1 1 -Iff 
XX— xo. 




Cioinis Tigure ^ 


ft? ft^ 


Y 
A 


ns/iA 1 CI 1 ^ no "^A — n;)np 1A linp If)* 

see po^e iine oh pai^c xo» imc au. 


Xi7 — , 




examples 5,8 




X 


see exanple 8 
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SYVANEN A -C ET AL: ■DETECTION OF POINT 
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MUTATIONS BY SOLID-PHASE METHODS" 
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HUMAN MUTATION, 






vol. 3, no. 3, 1 January 1994, pages 






172-179, XPe0968O258 






see the vfhole document 
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see abstract and claim 1 
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see abstract 
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TANG K ET AL: "MATRIX-ASSISTED LASER 
DESORPTION/IONIZATION OF RESTRICTION 
ENZYME-DIGESTED ONA" 

RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 
vol. 8, no. 2, February 1994, pages 
183-186, XP000608266 
see the whole document 




42.82.83 




Y 


CTcccDT r CT Ai - "Ma t ri v-a «t^d laser 

dltUCKI tl #\L. Plan lA— aaaiaucu lasci 

desorption/ionization Time^of -flight Mass 
Sectrometry for the detection of 
polymerase chain reaction products 
containing 7-Deazapurinr moieties* 
ANALYTICAL BIOCHEMISTRY, 
vol. 243. 1966, pages 55-65, XP002O72977 
see the whole document 
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see abstract, claims 1 and 2 
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see the whole document 
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see claims 1,19 
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see the whole document 

WO 97 42348 A (SEQUENOM INC) 13 November 
1997 

see claims 1-48 



84-87 



1.7-10 



page 5 of 5 



VI tWI I t t4_f~ ft i 



Ion on patent family members 



|, tniemationaf AppCicatlon No 



Patent document 


Pufalicalion 




-1 ] 

Patent f amiy 


Pubttoalion 


oiled in search report 


date 




memt>er(s) 


date 


wo 9629431 A 


26-09-1996 


us 


5605798 A 


25-02-1997 




AU 


5365196 A 


G8-10-1996 






CA 


2214359 A 


26-09-1996 






EP 


0815261 A 


G7-01-1998 



WO 9416101 A 21-07-1994 AU 5992994 A 15-08-1994 

CA 2153387 A 21-07-1994 

EP 0679196 A 02-11-1995 

JP 8509857 T 22-10-1996 

US 5547835 A 20-08-1996 

US 5605798 A 25-02-1997 

US 5691141 A 25-11-1997 



WO 9632504 



17-10-1996 



WO 9513381 



18-05-1995 



AU 


5544696 A 


30-10-1996 


EP 


0830460 A 


25-03-1998 


US 


5645986 


A 


08-07-1997 


US 


5629154 


A 


13-05-1997 


AU 


1178195 


A 


29-05-1995 


AU 


682082 


B 


18-09-1997 


AU 


1209095 


A 


29-05-1995 


AU 


1330795 


A 


29-05-1995 


AU 


6058298 


A 


04-06-1998 


CA 


2173872 


A 


18-05-1995 


EP 


0728207 


A 


28-08-1996 


JP 


9502102 


T 


04-03-1997 


WO 


9513382 


A 


18-05-1995 


US 


5648215 


A 


15-07-1997 


US 


5686306 


A 


11-11-1997 


US 


5639613 


A 


17-06-1997 


US 


5693474 


A 


02-12-1997 


WO 


9513383 


A 


18-05-1995 



. OE 4431174 


A 


07- 


-03- 


■1996 


NONE 


; GB 2260811 


A 


28- 


.04. 


■1993 


NONE 


WO 9617080 


A 


06- 


-06- 


■1996 


NONE 


WO 9515400 


A 


08- 


-06- 


-1995 


NONE 



Focm PCTASA/ZIO (patet« trnvHf •noex) (Jutr t992) 



page 1 of 3 









1 International AppHcatloo No 






^oa ofl patent family memtien 


; JS 97/20444 


Polefit dcxximent 




Publication 


Patent family 


PuUication 


cited in search report 




dale 


membeifs) 


date 


WO 961G648 


A 


11-04-1996 


AU 


3998195 A 


26-04-1996 






CA 


2118048 A 


31-G3-1996 


wo 9323563 


A 


25-11-1993 


AU 


4068293 A 


13-12-1993 






CA 


2135606 A 


25-11-1993 








EP 


0641391 A 


08-G3-1995 








JP 


8500725 T 


30-Q1-1996 




A 


02-05-1996 


NONE 






EP 6593789 


A 


27-04-1994 


JP 


5308999 A 


22-11-1993 






WO 


9323567 A 


2**-! 1 -IQQ'^ 


WO 9615262 


A 


23-05-1996 


AU 


3851495 A 


CkiZ CIC 1QQA 


« 




CA 


2205017 A 










EP 


0791074 A 


07 CkQ. 10Q7 


WO 8906700 


A 


27-07-1989 


AU 


3058589 A 


11-68-1989 






DE 


68908054 T 


19-83-1994 








DK 


463089 A. 


tw— Cf J — X^07 








EP 


0359789 A 










JP 


2006724 A 










.IP 


2563054 T 










NO 


300782 B 


2i-o/-iyy/ 


WO 8903432 


A 


20-04-1989 


US 


4962037 A 


69-19-1990 






CA 


1314247 A 


69-63-1993 








DE 


3854743 D 


11-61-1996 








DE 


3854743 T 


UiJ X^270 








EP 


0381693 A 










JP 


3502041 T 




US 5288644 


A 


22-02-1994 


US 


5453247 A 




. WO 9421822 

■s; 


A 


29-09-1994 


AU 


687801 B 


05-03-1998 






AU 


6411694 A 


11-10-1994 








CA 


2158642 A 


29-09-1994 








EP 


0689610 A 


03-01-1996 








JP 


8507926 T 


27-08-1996 








US 


5622824 A 


22-04-1997 



Fonn PGXflSA/210 ^atec« teniljr mnrmt^ (AKy 1932) 



page 2 of 3 



*^Uon on patent famlfy members 



P5 



•JS 97/20444 



Patent document 
cited in search report 


1 

1 


PubOcation 
dale 


Patent f anUfy 
membef<8) 


Publication 
date 






G4-G7-1995 


flic* 

us 


5258506 


A 


02-11-1993 








us 


5118605 


A 


02-06-1992 








US 


4775619 


A 










CA 


2088257 


A 


28-01-1992 








EP 


0543889 


A 


02-06-1993 








JP 


8311091 


A 


26-11-1996 

CW XX X^7U 








JP 


9031090 


A 


04-02-1997 








JP 


2552048 


B 


06-1 1-1996 

W Xx X^^v 








PL 


170146 


6 


31-10-19Q6 

^'X XU X7?U 








PT 


98488 


A»B 


29-05-1992 








WO 


9202528 


A 


20-02-1992 








us 


5545730 


A 


13-08-1996 

X*' W X77\l 


- 






us 


5578717 


A 


26-1 1 -IQQfi 

Cw Xx 








us 


5552538 


A 










us 


5367066 


A 


22-1 1-1QQ4 








AT 


133714 


T 


15-02-1996 

X«/ VC X?9U 








OE 


3854969 


0 


14-03-1996 








DE 


3854969 


T 


30-05-1996 








EP 


0360940 


A 


V"T V » X ^ ^ w 








EP 


0703296 


A. 


27-0:5-lQQfi 








ES 


2083955 


T 


WX X77VJ 








JP 


2092300 


A 










JP 


2676535 


B 


17-11-1997 








US 


5380833 


A 


10-01-1995 


WO 9531429 


A 


23-11-1995 


US 


5643722 


A 


01-07-1997 






AU 


2635995 


A 


05-12-1995 








CA 


2189848 A 


23-11-1995 








EP 


0763009 


A 


19-03-1997 








JP 


10500409 


T 


13-01-1998 


WO 9742348 


A 


13-11-1997 


AU 


3003497 


A 


26-11-1997 >^ 



Focn PCT/lSAy210 (pattern f«n«y anncnt) (Ju»y t932J 



page 3 of 3 



r- . icf ^ 



Improving Mass Resolution In MALDI/TOF Analysis of DNA 

Kai Tang, SEQUENOM. Inc.. 101 Arc* Streetr Boston. MA 02 UO. L'SA 

Tim Worrall and Robert J. Cotter, Middle Atlantic Mass Spcctronieirv- Labomiorv- The Johns 
Hopkins University School of Medicine, 725 N. Wolfe Street Baltimwre.* MD 21 205. USA 

Hubert K6ster, Faculty of Chemistry, Department of Biochemistry- and Molecular Bioloev 
Umversity of Hamburg, Martin-Luther-King-Platz 6. 20146 Hamburg.'GerTOany 

While the mass range for detection of oligonucleotides by MALDI TOl- hus been extended 
sigmficantly (I), mass resolution in the high mass range continues to be poor. The major 
problems appear to be low ionization/desorption efficiency and instahiUn- of molecular ions 
formed by the MALDI process. leading to losses of G. A and C bases through cleavuee of the N- 
glycosidic bond. When such fragmentation is meta.stable and occurs in tlic accelerating reaion. 
considerable peak broadening results, which cannot be corrected or ct>mpcn!;aied bv ion optical 
means. In titis report we demonstrate that by choosing proper matrix combinations both prompt 
and metastable fragmentation can be greatly reduced, resulting in mass resolutions of better tlian 
500 for single-stranded DNA up to a 40-mer in a linear TOF mass spccffiMneier. In addition, we 
have also found that considerably higher mass resolurion could bo obtained lor a 2^.mer using a 
low witage (5kV) instnuneat equipped with a cuned field rcflcctn)n (2). under conditions" in 
vjuch metastable fragmentation is reduced. Mass spectra ha\e been obtained for .se\eral mixtures 
of oligomers usuig these approaches, and suggest wa>s in uliish mass re.solution mielit be 
unproved for larger oligomers. 

(1) Tang. N.I. Taranenko. S.L. .Altaian. L.Y. Chang and CM. Chen. Rapid Commun Mass 
Specteom. 8 (1994) 727-730. 

(2) T.J. Cornish and R.J. Cotter, Rapid Commun. .Mass Specirom. 8 ( 1 994) 7X I -785. 



